Augmented reality guidance for spinal procedures using stereoscopic optical see-through head mounted displays and tracking of instruments and devices

ABSTRACT

Embodiments disclose a real-time surgery method and apparatus for displaying a stereoscopic augmented view of a patient from a static or dynamic viewpoint of the surgeon, which employs real-time three-dimensional surface reconstruction for preoperative and intraoperative image registration. Stereoscopic cameras provide real-time images of the scene including the patient. A stereoscopic video display is used by the surgeon, who sees a graphical representation of the preoperative or intraoperative images blended with the video images in a stereoscopic manner through a see-through display.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No.16/598,697, filed Oct. 10, 2019, which is a continuation of U.S.application Ser. No. 16/518,426, filed Jul. 22, 2019, now U.S. Pat. No.10,511,822, which is a continuation of U.S. application Ser. No.16/240,937, filed Jan. 7, 2019, which is a continuation of U.S.application Ser. No. 15/972,649, filed May 7, 2018, now U.S. Pat. No.10,194,131, which is a continuation of U.S. application Ser. No.14/753,705, filed Jun. 29, 2015, now U.S. Pat. No. 10,154,239, whichclaims the benefit of and priority to provisional application No.62/097,771, filed Dec. 30, 2014. The entirety of each these applicationsare hereby incorporated herein by reference.

BACKGROUND INFORMATION Field of the Invention

Embodiments are directed towards image-guided surgery, and moreparticularly CT-guided, MR-guided, fluoroscopy-based or surface-basedimage-guided surgery, wherein images of a portion of a patient are takenin the preoperative or intraoperative setting and used during surgeryfor guidance.

Background

In the practice of surgery, an operating surgeon is generally requiredto look back and forth between the patient and a monitor displayingpatient anatomical information for guidance in operation. In thismanner, a type of mental mapping is made by the surgeon to understandthe location of the target structures. However, this type of mentalmapping is difficult, has a steep learning curve, and compromises theaccuracy of the information used.

Equipment has been developed by many companies to provide intraoperativeinteractive surgery planning and display systems, mixing live video ofthe external surface of the patient with interactive computer generatedmodels of internal anatomy obtained from medical diagnostic imaging dataof the patient. The computer images and the live video are coordinatedand displayed to a surgeon in real time during surgery, allowing thesurgeon to view internal and external structures and the relationshipbetween them simultaneously, and adjust the surgery accordingly.

Preoperative or intraoperative image registration with surfacereconstruction has been done in conventional surgery navigation systemseither with a single 3D scanner device that functions at the same timeas video camera (e.g. time-of-flight cameras). These conventionalsystems display the surgeon's main viewpoint, or a video camera orstereoscopic video cameras that are used as viewpoint for the surgeonare used for processing a surface reconstruction. These conventionalsystems may enhance the surface reconstruction or image registrationwith other techniques, such as optical or infrared techniques, markers,etc. However, these systems are limited in the availability of precise3D surfaces, in their precision and speed of image registration ofpreoperative or intraoperative image with the 3D surfaces, and inblending such registered images with the viewpoint of the surgeon.

Accordingly needs exist for more effective systems and methods thatcombine real-time preoperative images with virtual graphics associatedwith the preoperative images, wherein the combination of thepreoperative images and virtual graphics is displayed on a stereoscopic,see through, head mounted display.

SUMMARY OF THE INVENTION

Embodiments disclosed here describe a real-time surgery navigationmethod and apparatus for displaying an augmented view of the patientfrom the preferred static or dynamic viewpoint of the surgeon.Embodiments utilize a surface image, a graphical representation theinternal anatomic structure of the patient processed from preoperativeor intraoperative images, and a computer registering both images.Responsive to registering the images, a head mounted display may presentto a surgeon an augmented view of the patient, wherein the augmentedreality is presented via a head mounted display.

Embodiments disclosed herein include a stereoscopic camera system. Thestereoscopic camera system may be configured to provide real-timestereoscopic images of a target portion of the patient. In embodiments,the stereoscopic camera system may include a 3D scanner system that isconfigured to determine location data and orientation data, wherein thelocation data and orientation data are determined in reference to acommon coordinate system.

Responsive to the stereoscopic camera system recording media, anddetermining the location data and orientation data, a stereoscopic viewof the 3D volume image may be output to a stereoscopic display to thesurgeon in real time. The stereoscopic view of the 3D volume image maybe blended in a same position as the patient appears in the stereoscopicvideo images during surgery. The stereoscopic view of the 3D volumeimage are displayed in the preferred manner, e.g. using backgroundsubtraction techniques, the 3D volume image appearing over the patientas background model, the hands and instruments appearing as foregroundobjects.

Embodiments may be configured to assist in real time during surgery,wherein the stereoscopic view of the 3D volume image is presented in asurgeon's field of view in a stereoscopic manner, e.g. graphicalrepresentations of instruments tracked, surgical guides or techniques,anatomical models, etc. as needed. Accordingly, utilizing thestereoscopic view of the 3D volume image, the surgeon may be able tomake adjustments to the stereoscopic view of the 3D volume image. Forexample, the surgeon may modify the stereoscopic view of the 3D volumeimage by selecting a transparency, color and contrast of each imagelayer displayed, using an available real-time user interface means,which may include gesture recognition methods.

Embodiments may be independent devices and processes for each main taskprovided during surgery: surface reconstruction and image registration,stereoscopic video and stereoscopic image registration. Embodiments mayalso be configured to provide an enhanced depth perception throughbackground subtraction methods, and real-time user interaction, whichmay change the separation of the stereoscopic video cameras, adjustingthe position of the registered 3D volume, displaying the 3D volume in aprecise manner, adapting for pose change detected in the surface,adjusting the degree of transparency, color and contrast, etc.

Embodiments disclosed herein disclose systems that are configured torecord stereoscopic video with at least two mounted cameras. The mediarecord by the cameras may be in the field of view of a surgeon.Utilizing a head-mounted display, the surgeon may freely move in theoperating room, keeping the desired field of vision defined by theposition and orientation of the mounted cameras. With the mountedcameras and the head mounted display, the surgeon would be able to viewthe media recorded by the mounted cameras.

Virtual graphics may be added to the media recorded by the two cameras.Responsive to the virtual graphics being added to the recorded media,the surgeon may be presented on the head-mounted display a preoperativeimage, such as a 3D volume image of a previous CT. The preoperativeimage may be presented, recorded, or registered (referred to hereinaftercollectively and individually as “registered”) over the patient, in realtime. Thus, the internal anatomical structures of the patient may beblended with the media recorded by the mounted cameras.

In embodiments, tracking may be configured to be added to instruments orimplants within the preoperative image, wherein virtual graphics areassociated with views inside the patient presented to the surgeon.Accordingly, embodiments may be configured to register preoperativeimages blended with virtual graphics over a target portion of a patient,wherein the blended images are presented over a visual field of asurgeon.

In embodiments, an intermediate 3D surface may be obtained by surfacereconstruction via 3D scanners. The intermediate 3D surface may be usedfor registration with a 3D volume obtained by volume rendering via imagedata from a CT or MR scan. The 3D volume image of the patient may beautomatically located in real time to the positioned of the patientbased on a common coordinate system between the stereoscopic cameras,head mounted display, the virtual graphics, and/or the 3D surface. The3D volume image may be any surface rendering of a preoperative image.

Tracking a 3D scanner's virtual camera and the mounted camera to thecoordinate system, may define where an augmented view may be positionedon the head mounted display. Accordingly, the preoperative images may beutilized without markers, which may allow for more flexible and quickerregistration.

These, and other, aspects of the invention will be better appreciatedand understood when considered in conjunction with the followingdescription and the accompanying drawings. The following description,while indicating various embodiments of the invention and numerousspecific details thereof, is given by way of illustration and not oflimitation. Many substitutions, modifications, additions orrearrangements may be made within the scope of the invention, and theinvention includes all such substitutions, modifications, additions orrearrangements.

BRIEF DESCRIPTION OF THE DRAWINGS

Non-limiting and non-exhaustive embodiments of the present invention aredescribed with reference to the following figures, wherein likereference numerals refer to like parts throughout the various viewsunless otherwise specified.

FIG. 1 shows a block diagram of the present invention;

FIG. 2 shows a perspective view of a surgery navigation system of thepresent invention;

FIG. 3 shows a flow diagram of a method of the present invention;

FIG. 4 shows a perspective view of the surgery navigation system of thepresent invention;

FIG. 5 shows a perspective view of the surgery navigation system of thepresent invention;

FIG. 6 shows a block diagram depicting a computing device of the presentinvention.

Corresponding reference characters indicate corresponding componentsthroughout the several views of the drawings. Skilled artisans willappreciate that elements in the figures are illustrated for simplicityand clarity and have not necessarily been drawn to scale. For example,the dimensions of some of the elements in the figures may be exaggeratedrelative to other elements to help improve understanding of variousembodiments of the present disclosure. Also, common but well-understoodelements that are useful or necessary in a commercially feasibleembodiment are often not depicted in order to facilitate a lessobstructed view of these various embodiments of the present disclosure.

DETAILED DESCRIPTION OF THE INVENTION

In the following description, numerous specific details are set forth inorder to provide a thorough understanding of the present embodiments. Itwill be apparent, however, to one having ordinary skill in the art thatthe specific detail need not be employed to practice the presentembodiments. In other instances, well-known materials or methods havenot been described in detail in order to avoid obscuring the presentembodiments.

FIG. 1 shows an exemplary embodiment of the surgical navigation system.Surgical navigation system 100 may include devices configured to createa 3D rendering of a region of interest.

Using computer means 100, volume data of a patient scanned with apreoperative imaging 102 or an intraoperative imaging 106 device (e.g.CT scanner) is rendered as a 3D volume image using a volume renderingtechnique 104 and stored for processing, wherein the volume data isassociated with a volume of the patient. Preoperative 102 andintraoperative images 106 are also stored as digital images 108 forprocessing.

While computing means 110 is scanning the volume data, 3D scanner system110 may be configured to capture a 3D surface 112 of the target portionof the patient 118, and a stereoscopic camera system (e.g. pair ofcameras) 114 may be configured to obtain a stereoscopic video 116 of thescene, including the target portion of the patient 118.

Registration of the 3D volume and the 3D surface 120 is performed bycomputer means 100, as is the registration of the stereoscopic videowith the 3D surface 122. In embodiments, registration of 3D volume 104and stereoscopic video 116 is completed through an intermediateregistration of both images with the 3D surface image 112 into a commoncoordinate system,

The images are processed 124 and sent to the stereoscopic display 126used by the surgeon 128. The registration and other image processing 124by computer means 100 is adjusted by the surgeon 128 or other usersthrough general user interface means 130 before surgery, or throughreal-time user interface means 132 during surgery, e.g. by interactionof the surgeon 128 with a device capable of gesture recognition (e.g.the same 3D scanner system 110 and/or stereoscopic display system 126).

User interface means 130, 132 may include graphical user interface means(e.g. virtual graphics like buttons) displayed within the surgeon's 128view, through a 3D display 126, which may be a virtual reality,augmented reality, etc. device. Augmented reality (AR) help 134 is addedto the processed images sent to the stereoscopic display system 126.

Tracking means 136, such as optical markers, may be attached to thepatient 118 providing anatomic landmarks of the patient during thepreoperative 102 or intraoperative images 106, and during the 3Dscanning process for registration of the 3D volume with the 3D surface120. The optical markers, alone or in combination with other trackingmeans, such as inertial measurement units (IMU), may be attached to the3D scanner system 110, stereoscopic camera system 114, the surgeon 128(e.g. in the head-mounted stereoscopic display 126), as well as to anyinstruments and devices 138 (e.g. screws, plates, pins, nails,arthroplasty components, etc., broaches, screwdrivers, motors, etc.)used by the surgeon 128. Utilizing tracking means, systems may offerreal-time data for a more precise location and orientation of images andobjects in the common coordinate system used. The stereoscopic video 116may be sent directly to the stereoscopic display 126, without undergoingimage processing 124, if the surgeon 128 or other users select thatoption using the available interface means 130, 132.

FIG. 2 shows an embodiment of an example of the perspective view of thisnavigation system in a model operating room. As depicted in FIG. 2, apair of video cameras 201, 202 (corresponding to the stereoscopic camerasystem 114 in FIG. 1) may be configured to capture a stereoscopic view116 of a region of interest. In FIG. 2, a 3D scanner 203 (correspondingto the 3D scanner system 110 in FIG. 1) is positioned proximate to thestereoscopic cameras 201, 202, in a known location and orientation anglewith respect to them. The cameras 201, 202 and 3D scanner 203 areattached e.g. to an articulated mechanical arm 204 that is suspendedfrom the ceiling 205. A second 3D scanner 206 (also part of the 3Dscanner system 110), e.g. with motion tracking capabilities, is attachedto another mechanical articulated arm 207.

The 3D scanners 203, 206 capture a 3D surface 112 of the target portion208 of the patient 209 located on a surgical table 210. Images and dataobtained from 3D scanners 203, 206 is stored and processed in thecomputer 211 (corresponding to the computer means in 100 in FIG. 1) usedfor image processing, and the surgeon 212 and other users are able tointeract with it through the available interface means 130, 132 (e.g.gesture recognition, or mouse and keyboard). The processed images 124(e.g. stereoscopic video 116 blended with stereoscopic 3D volumetricimages 104 of the internal anatomical structures) are displayed to thesurgeon 212 wearing the head-mounted 213 stereoscopic display 214(corresponding to the stereoscopic display system 126 in FIG. 1) in realtime. Instruments 215, 216 are tracked e.g. by markerless optical means(e.g. using the 3D scanners 203, 206 in real time), by optical meansusing markers 217, by inertial measurement units 218, or a combinationof these and other tracking means.

To the surgeon 212, the internal structures of the patient 209 appeardirectly superimposed on the target portion of the patient 208 with theselected transparency, or the selected layers of the stereoscopic 3Dvolumetric images appear blended with the stereoscopic video 116, oronly the 3D volumetric images 104 are shown, or any other stereoscopicimages from the sources available are shown to the surgeon 212, modifiedby any software-implemented change, like zoom, color or contrastadjustments, depending on the previous 130 or real-time interaction 132to modify the image processing 124.

Responsive to surgeon 212 moving his or her head around to view thespatial relationship of the structures from varying positions, computermeans 100 may be configured to provide the precise, objectiveregistration between the digital images of the internal structures 104,108 and the surface reconstruction 112 of the portion of the patient208. This in situ or “augmented reality” visualization gives the surgeon212 intuitively based, direct, and precise access to the imageinformation in regard to the surgical task. Headphones 219 include soundto the augmented reality experience of the surgeon 212, who can hear thedifferent steps of the surgery as it develops, or the advice of otherprofessionals in a different place, who may be seeing the same imagesdisplayed to the surgeon 212, thanks to online communication.

FIG. 3 shows a flow diagram depicting an embodiment of a method 300 forintegrating preoperative 102 and stereoscopic video 116 images. Theoperations of method 300 presented below are intended to beillustrative. In embodiments, method 300 may be accomplished with one ormore additional operations not described, and/or without one or more ofthe operations discussed. Additionally, the order in which theoperations of method 300 are illustrated in FIG. 3 and described belowis not intended to be limiting.

At operation 302, the computer 100 receives preoperative image data(e.g. CT scan) 102.

At operation 304, the preoperative image data is processed into a 3Dvolume by a volume rendering technique (which is the same as the volumerendering 104 in FIG. 1). The volume rendering technique may be basede.g. on segmentation of the volume data of the anatomic internal partsof the portion of the patient 118.

At operation 306, one or more graphical representations of the imagedstructures may be provided. Operation 306 may be performed with userinteraction at Operation 308 (before or during surgery), using theavailable user interface means 130, 132.

At operation 310, a 3D scan of the patient may be received.

At operation 312, a precise surface reconstruction of the patient (whichis the same as the surface reconstruction 112 in FIG. 1) may beconstructed automatically.

At operation 314, the surface reconstruction of the patient may beadjusted with input from the surgeon 128 or other users.

At operation 316, the 3D volume rendering and the surface reconstructionmay be compared by automated means.

At operation 318, the pose of the patient may be determined duringsurgery.

At operation 320, the 3D volume may be adjusted according to thedetermined pose. More specifically, the 3D volume may be adjusted sothat the orientation of the 3D volume as displayed to the surgeonmatches the orientation of the portion of the patient on which thesurgeon is operating.

At operation 322, the surgeon or other users may be able to correct, inreal time, the pose.

At operation 324, the computer may receive stereoscopic video (which isthe same as the stereoscopic video 116 in FIG. 1) from the stereoscopiccamera system.

At operation 326, tracking means attached to the 3D scanner system 110,stereoscopic camera system 114, patient 118, imaging devices 106, andinstruments 138, may send data to the computer 100. The computer mayprocess and store the received data 326. For example, the received datamay include 3D surface and stereoscopic video.

At operation 328, the 3D surface data and data associated with thestereoscopic video may be compared.

At operation 330, orientation tracking data of each of the cameras thatform the stereoscopic camera system may be obtained. The obtainedorientation tracking data may be used in combination with theappropriate tracking means 136 for location.

At operation 332, a view of the 3D surface may be displayed to thesurgeon, wherein the view corresponds to the precise position of eachcamera in the common coordinate system.

At operation 334, patient tracking data may be compared with instrumentor device tracking data, e.g. with data from inertial measurement unitsand optical markers placed on the patient 118, and/or instruments 138.

At operation 336, the patient tracking data may be received by thecomputer.

At operation 338, the patient tracking data may be automaticallyprocessed, which may be configured to locate all tracked objects withinthe common coordinate system. Utilizing the common coordinate system andthe patient tracking data, a visualization associated with the patienttracking data within the common coordinate system may be displayed tothe surgeon or other users in real time, e.g. in numeric or graphicformat. Accordingly, the position of the cameras forming thestereoscopic camera system 114 may be dynamically determined, andutilizing the position of the cameras the precise location andorientation of the 3D surface model with respect to the view of eachcamera 114 may be determined. The position of the data recorded by eachcamera 114 in the common coordinate system corresponding to the positionof the virtual cameras may offer a view of the 3D surface.

At operation 340, a blended image output may be presented on thestereoscopic display 126. At operation 342, registration of the 3Dvolume and the stereoscopic video is done with help from an intermediate3D surface reconstruction. This process may be fully automatic bysoftware means, or may be performed with help from user interaction.

At operation 344, the registration of the 3D volume and the stereoscopicvideo may account for pose changes of the patient, and it is displayedto the surgeon according to the precise location of the cameras 114, sothat there is a seamless integration of all views (e.g. video, 3Dsurface and 3D volume) to both eyes.

At operation 346, the images are displayed stereoscopically to thesurgeon 128 in real time. The images may depict an augmented realityimage or “augmented view.”

At operation 348, the augmented reality image may include notificationhelps 348.

At operation 350, the surgeon 128 or other users may perform actions toinput data into to the system in real time during surgery using theavailable user interface means 132. Thus, modifying the image processing124 and customizing the view shown in the display 126.

Returning to FIG. 2, the stereoscopic camera system 114 captures astereoscopic view of the scene, wherein the stereoscopic camera system114 is comprised of a pair of cameras 201, 202. The cameras 201, 202 maybe positioned at a certain distance from each other. This distance isadjusted depending on the interpupillary distance of the surgeon 212,and on the distance from the cameras 201, 202 to the target field ofview, usually centered on the target portion of the patient 118. Thecomputer 211 receives the video images 116 and sends them to thestereoscopic display 219, with or without modifications through imageprocessing 124, such as additional virtual graphics, zoom, color orcontrast adjustments.

In embodiments, as depicted in FIG. 2, the pair of cameras 201, 202 maybe attached to an articulated mechanical arm 204 that is e.g. suspendedfrom the ceiling 205. In this exemplary embodiment, the 3D scanner 203is in a fixed position close to the cameras 201, 202, so that locationand orientation tracking between the devices is not necessary.Alternatively, the 3D scanner 206 may not be in a fixed position withrespect to the cameras 201, 202, and is e.g. attached to anotherarticulated mechanical arm 206 suspended from the ceiling 207.

Turning now to FIG. 4, the cameras 401, 402 are head-mounted 403 alongwith the stereo display 404, and worn by the surgeon 405. The 3D scanner406 is attached to an articulated arm 407, or alternatively, a 3Dscanner is also head-mounted close to the cameras 401, 402. Anadditional 3D scanner 408 (e.g. with motion tracking capabilities, as atime-of-flight camera) is e.g. attached to an articulated mechanical arm409 e.g. suspended from the ceiling, or placed on the ground, or in anyother suitable location.

Turning now to FIG. 5, the cameras 501, 502 are mounted on a pole 503attached to a belt 504 worn by the surgeon 505, who sees the scenethrough the head-mounted stereo display 506. The 3D scanner 507 isattached to the same pole 503. Alternatively, or in combination with theother embodiments, a 3D scanner is attached to an articulated mechanicalarm e.g. suspended from the ceiling, or placed on the ground.Alternatively, a 3D scanner is mounted on tripods standing on theground. Alternatively, the camera system 114 is composed of a single,two-dimensional camera, used in combination with a camera from the 3Dscanner system 110. It will be understood that the above-describedexamples of the number and alternative positions of cameras composingthe stereoscopic camera system 114 (as well as the number and positionof devices composing the 3D scanner system 110) are presented for thepurpose of example, and are not intended to be limiting in any manner.

In embodiments, the surgeon 212 is able to move and rotate the cameras201, 202 during surgery. This is done for example through interactionwith the computer system 211 that controls the mechanical arm 204movement, e.g. with gesture recognition by the stereoscopic display 214(e.g. virtual reality display), or by the 3D scanner 203 (e.g.time-of-flight camera with motion tracking), or with voice commands byvoice recognition from an appropriate device. A third person can alsointeract directly with the computer 211, e.g. according to the surgeon's212 commands.

In embodiments the stereoscopic camera system 114 is mobile, fitting theevolving needs of the surgery through changes in location andorientation. Or alternatively the cameras 114 are placed fixed in acertain part of the operating room as previously designed, to fit thepotential needs of the surgeon 128.

In embodiments, the cameras composing the stereoscopic camera system 114may have special lenses (e.g. fisheye lenses, or wide-angle lenses) andthey are arranged e.g. vertically side by side, to obtain a wider viewand allow for the use of a virtual reality display, from thosecommercially available, according to each virtual reality device'spreferred camera configuration. The movement of the surgeon's 128 headis tracked by the device, and the field of view of the surgeon 128 canchange, while the cameras remain fixed.

In embodiments, the stereoscopic camera system 114 is composed of threeor more cameras, used to obtain different stereoscopic views, e.g.arranged in circle for a combined 360° view (for example in combinationwith a virtual reality display), or e.g. arranged following the outlineof the target portion of the patient 118. The stereoscopic video 116 ofthe scene displayed to the surgeon 128 changes by selecting a differentcamera pair through computer means 100, without the need to move thecameras, for example automatically when the surgeon 128 traverses acertain location, or by interaction of the surgeon 128 or a third personthrough the real-time user interface means 132.

The stereoscopic camera system 114 may be configured to provide the sameimages for all operating surgeons, or alternatively each surgeon mayhave a different stereoscopic video image displayed to him or her, oralternatively some surgeons share a view and others have individualviews, depending on the needs of the surgery and the preferences of theindividual surgeons.

Returning now to FIG. 1, the preoperative 102 or intraoperative image106 information, may be received from a CT scan or an MR scan, as wellas ultrasound, PET, and C-arm cone-beam computed tomography.Preoperative X-ray and stereoscopic X-ray images, intraoperativefluoroscopic and stereoscopic fluoroscopic images, and other imagemodalities that allow for stereoscopic and 3D representations are alsoused, either directly in a digital format 108, or as graphical 3Dvolumetric representations 104 of the image data.

The volume rendering may be a set of techniques used to display atwo-dimensional (2D) projection of a 3D discretely sampled data set,typically a 3D scalar field. Computer means 100 may be configured toprocess the 3D data set (e.g. a group of 2D slice images acquired by aCT or MRI scan) with a volume rendering 104 technique, and provide oneor more graphical representations of the imaged structures, which mayinclude the 3D volume image representing the anatomic internal parts ofa portion of the patient 118.

The volume rendering 104 techniques include 3D surface rendering, 3Dvolume rendering, as well as fusion, parametric mapping, andmulti-object rendering. More precisely, the volume rendering 104 may bedone using one or more of the available methods, such as direct volumerendering (e.g. volume ray casting, splatting, shear warp, texture-basedvolume rendering, etc.), maximum intensity projection,hardware-accelerated volume rendering, and any of the availableoptimization techniques (e.g. empty space skipping, early raytermination, volume segmentation, image-based meshing, pre-integratedvolume rendering, etc.). Such methods may be further helped by the userby using regions (ROI) or volumes of interest (VOI), lookup tables(LUT), and any of the available methods for refining the volumerendering 104. It will be understood that these examples of volumerendering methods and their outputs are presented for the purpose ofexample, and are not intended to be limiting in any manner.

For example, 2D slice images are acquired by a CT scan, a virtual camerais defined in space relative to the volume, iso-surfaces are extractedfrom the volume and rendered e.g. as polygonal meshes, or directly as ablock of data. In this context, a graphical representation may be a dataset that is in a “graphical” format (e.g. stereolithography or STLformat), ready to be efficiently visualized and rendered into an image.The surgeon 128 or other users can selectively enhance structures, coloror annotate them, pick out relevant ones, include graphical objects asguides for the surgical procedure and so forth. This pre-processing maybe completed offline, in preparation of the actual real-time imageguidance, through the available general user interface means 130, or itmay be done (or the previous preparation adjusted) during surgery, usingthe available real-time user interface means 132. In embodiments,virtual cameras that provide the 3D volume image may be positioned inspace relative to the volume data, wherein the virtual cameras arecorrespondingly positioned to the common coordinate system, e.g. in thesame position as the cameras composing the stereoscopic camera system114, and/or in the same position as the devices composing the 3D scannersystem 110, to allow for an easier and quicker registration of images,according to the different embodiments described.

In embodiments, two-dimensional images (e.g. fluoroscopic intraoperativeimages, X-rays, ultrasonographic images, etc.) are also used as 3Dvolume images of the target portion of the patient 118, using any of theavailable methods, e.g. 2D-3D reconstruction using statistical shapemodels, as e.g. a distal femur 3D reconstruction from registration of anX-ray imaging (e.g. lateral and anteroposterior projections) of thefemur, with a 3D statistical shape model of the distal femur, withadjustments made by the surgeon 128 or other user, through the availableuser interface means 130, 132.

In embodiments, that 2D-3D registration of intraoperative orpreoperative 2D images is done with help from volume rendering 104 ofthe available images (e.g. from CT scan or MR scan of the target portionof the patient 118) or from the surface reconstruction 112 of anatomicalstructures of the patient done during surgery. For example, in a pelvisfracture involving a hemipelvis, registration of the 2D or stereoscopicfluoroscopic intraoperative images 106 of the affected hemipelvis isdone e.g. with a statistical shape model of a hemipelvis, or with the 3Dvolume of the CT scan of the healthy contralateral hemipelvis, or acombination of them, with interaction of the surgeon 128 and other usersin real time, to obtain a precise 3D graphical representation of thefracture reduction obtained during surgery, before proceeding with thedefinitive fixation. In embodiments, a 3D scanner system 110 may becomposed of one or more devices capable of capturing shapes of objects(and sometimes its appearance, e.g. color), usually outputting a pointcloud as a data file, allowing for the construction of three-dimensionalsurface models of the object scanned. For example, a 3D scanner system110 may include a laser scanner, a time-of-flight 3D laser scanner, astructured-light 3D scanner, hand-held laser scanner, a time-of-flightcamera, a depth camera, or a combination of these or other devices.

The 3D scanner 110 may be moved around the target portion of the patient118 to obtain a precise 3D surface image of it through surfacereconstruction 112. For example, computer means 100 may receive a dense3D point cloud provided by the 3D scanning process, that represents thesurface of the target portion of the patient 118 by a point cloudconstruction algorithm, e.g. by the intersection of two lines emanatingfrom the camera center, or from the midpoint of a line perpendicular tothese two lines. After the data acquisition, some type of smoothing ofthe data (e.g. Gaussian) may suppress random noise.

In embodiments, the point cloud obtained is used directly forregistration purposes, e.g. by comparing it with point clouds used torepresent volumetric data from preoperative 104 or intraoperativeimaging 106. In embodiments, computer means 100 make use of techniquesof surface reconstruction 112 (e.g. Delaunay triangulation, alphashapes, ball pivoting, etc.) for converting the point cloud to a 3Dsurface model (e.g. polygon mesh models, surface models, or solidcomputer-aided design models). The surface reconstruction 112 is fullyautomated by computer means 100, and can be assisted 314 by the surgeon128 or other users through the available user interface means 130, 132.

The 3D scanning and surface reconstruction 112 of the target portion ofthe patient 118 is made when the surgeon 128 desires the 3D scanning andsurface reconstruction, through interaction with software means 132.Alternatively, 3D scans are made after a predetermined amount of timehas passed since the last scan. Alternatively, 3D scans are madewhenever computer means 100 detect movement of the portion of thepatient 118 (i.e. determining patient pose 318), e.g. by comparing asurface reconstruction obtained from the stereoscopic video 116 to thelast surface reconstruction 112 of the 3D scanner 110, or bymarker-based optical tracking, or by IMU tracking, or any other trackingmeans 136.

In embodiments, multiple 3D scanners are used to make a reconstructionof the surface 112 of the target portion of the patient 118 without aneed for movement of the devices composing the 3D scanner system 110, orlimiting the movement needed for each device. In another embodiment,multiple 3D scanners are also used for optical tracking 136 ofinstruments and devices 138, and for determining their location andorientation. In another embodiment, a handheld 3D scanner is used, e.g.a portable 3D scanner forming part of the 3D scanner system 110 is heldin the hand and moved around the target portion of the patient, and oncethe desired surface is obtained, it is placed in a previously definedfixed position.

In embodiments, the 3D scanner system 110 is composed of a dedicatedstereoscopic camera system, composed of at least a pair of 2D videocameras. The stereoscopic camera system may be configured to capturestereoscopic video images, wherein the cameras are positioned in adesired position with respect to the other cameras forming the system toobtain the more precise surface reconstruction. The two-dimensionalimages taken by the cameras are converted to point clouds or meshsurfaces, or both, using known methods for surface reconstruction, forexample range imaging methods (e.g. structure-from-motion). Such surfacereconstruction from images of two-dimensional cameras may be used alone,or in combination with a more precise 3D scanner to make the surfacemodel 112 obtained (and its location and orientation angle with respectto the stereoscopic camera system 114), more accurate.

In embodiments, the 3D scanner system 110 is composed of time-of-flightcameras and/or other motion tracking devices, which are used for gesturerecognition by software means, allowing for interaction of the surgeon128 or other users with the computer 100 during surgery through thereal-time user interface means 132, without touching anything in theoperating room. Alternatively or in addition to gesture recognition,voice commands are used to avoid touching the computer 100 or otherdevices included in this invention and controlled by computer means 100.

In embodiments, the 3D scanner system 110 is composed of devices capableof real-time scanning (e.g. time-of-flight cameras), which are used toobtain an instant, real-time registration of the 3D surface 112 with the3D volume 120 and the stereoscopic video 122 throughout the surgery.Blending that real-time 3D surface 112 with one acquired with a moreprecise device (e.g. 3D laser scanner), a more adaptable, real-timeregistered surface reconstruction is obtained, offering an instant,high-quality image registration during surgery.

In embodiments, the actual shape of the patient's body may be configuredto be rendered into a 3D volume, such that fitting the overall shape ofthe 3D surface model to the patient 118 results in fitting of the modelinternal anatomy to the patient 118.

The 3D/3D registration, such as the 3D volume—3D surface registration120, as well as 2D/3D registration, is done by computer means 100. Inembodiments, rigid registration methods are used, such asgeometry-based, paired-point, surface-based, intensity-based, etc. Inembodiments, nonrigid registration methods are used, e.g. feature-based,intensity-based, etc. In embodiments, biomechanical models, such asstatistical shape models, are incorporated into the registration method.In embodiments, registration is done with the help of optical markers,e.g. color or reflective markers in certain predefined anatomicallandmarks of the patient 118 for the 3D scanner system 110,corresponding to markers of the same size and shape placed on the samepredefined anatomical landmarks of the patient 118 during thepreoperative 102 or intraoperative imaging 106 (e.g. radiopaque markersfor CT or x-ray imaging), or virtual markers placed on the predefinedanatomical parts in the graphical representation of the images obtained,in the preoperative or intraoperative setting, through the availableuser interface means 130, 132. In other embodiments, a combination ofrigid and nonrigid registration methods are used.

For example, a markerless registration method may be used. Morespecifically, an output of the surface reconstruction 112 is stored inpoint sets or depth maps. The markerkless registration may be completedby applying known methods (e.g. the iterative closest point algorithm,the Curie point depth algorithm, or the scale invariant featuretransform algorithm) to the output of volume rendering 104 (e.g. 3Dvolume image). This markerless registration may be completed by 2D orstereoscopic digital images 108 from previous imaging studies 102 orintraoperative images 106 (e.g. CT scans or MR scans).

Point matching is done either directly or with previous transformationsto obtain more accurate outputs, e.g. calculating Gaussian curvaturefrom the depth map, or excluding outliers (e.g. with random-sampleconsensus) during point matching. With the matching results, aligningthe two coordinate systems is computed with software (e.g. Procrustesanalysis). For a more precise registration, the preoperative images 102are adjusted to the software needs (e.g. through segmentation by ahistogram-based threshold). It will be understood that theseregistration methods are presented for the purpose of example, and arenot intended to be limiting of the actual 3D volume—3D surfaceregistration 120 method used in any manner.

In embodiments, registration 120 is done in a fully automated manner bycomputer means 100. In embodiments, the surgeon 128 or other users mayadjust 322 the 3D volume—3D surface registration 120 in real time duringsurgery through the available user interface means 132, as describedbelow in the different embodiments. In embodiments, the 3D volume—3Dsurface registration 120 is made before surgery matching the volumerendering 104 to a preoperatively done surface reconstruction 112 of thetarget portion of the patient 118 (e.g. in the most likely position thatthe patient 118 will have during surgery), and then a real-time 3Dsurface—3D surface registration is done during surgery.

Embodiments may be configured for real-time correction of patient posein the 3D volume rendering 320. For example, the 3D scanner system 110obtains the surface reconstruction 112 of a target portion of thepatient 118, the surface being e.g. the superficial skin of the patient118, and the 3D surface is blended by computer means 100 with the 3Dvolume of the preoperative imaging 102 (e.g. CT scan) of the patient118. The blended part being e.g. the 3D volume of the skin surface,preferably by using preselected constant anatomic landmarks, e.g. theprominent spinous processes in the skin surface of the patient's 118spine.

The internal anatomic structures are therefore displayed based on thepositioning of patient 118 was when the preoperative image 102 wasobtained, e.g. with the patient 118 in the supine position during the CTor MR scan. Responsive to performing a medical procedure on the targetanatomic structures, and e.g. bony structures or soft tissues are(partially) exposed, a new 3D scan is obtained with the 3D scannersystem 110. The new 3D surface obtained of the visible bones and softtissues is blended with the 3D volume of bone and soft tissuestructures, which may result in a more accurate registration of bothimages 120. This may be due to skin and subcutaneous fat tissue of thepatient 118 changing more than internal anatomic structures during thetime passed between the preoperative imaging 102 and the actual surgery.

The 3D surface is therefore used to determine the pose of the portion ofthe patient 118 during surgery 318, and adjust the presentation of the3D volume, which is based upon a pose 320, so that the location andorientation of the 3D volume as displayed to the surgeon 128 matches thelocation and orientation of the body part of the patient 118 on whichthe surgeon 128 is operating. That adjustment of the pose changecorrection 320 is done by automatically processing new surfacereconstructions 112 in real time. Alternatively, creating pose changecorrections 320 is done in fixed intervals; alternatively, it may bedone at the surgeon's 128 discretion Alternatively, creating pose changecorrections 320 is done whenever deeper dissection has been carried out,or when the patient 118 has moved, as detected e.g. by tracking means136. The surgeon 128 or other users are able to correct the pose changeadjustments 320 made in real time 322 through the available userinterface means 132.

In embodiments, markers are attached to the selected anatomic landmarksof the patient 118, to define and locate its position in the commoncoordinate system. Thus computer means 100 can calculate changes inposition, and when a predetermined minimum (angle or distance) thresholdis crossed, it rescans the portion of the patient 118 and constructs anew 3D surface 112.

In embodiments, pose changes detected 318 in the surface reconstruction112 are translated into the 3D volume image. The detected pose changesmay be translated by adjusting the supposed movement of the internalanatomic mobile parts. The supposed movement of the internal anatomicmobile parts may be detected from external pose changes when anatomicparts are individualized in the 3D volume image, which may be donepreoperatively, e.g. during or after the volume rendering 104 ofpreoperative images 102, using the available user interface means 130.The movement of the body is tracked and translated into the movement ofinternal anatomic parts of a virtual anatomical (e.g. skeletal) model306, with the help of computer means 100. In that manner, instead ofblending a 3D surface with a static 3D volume image directly, the posechange 318 in the 3D surface is interpreted by computer means 100 andapplied 320 to a predefined virtual anatomical model 306, to which the3D volume of the patient is registered, obtaining more precise positionsof the internal anatomic parts, e.g. location and orientation of jointsand bones by using an anatomic skeletal model.

For example, if the target portion of the patient 208 is the spine, thevolume rendering 104 of a previous CT scan may obtain individuallyrendered parts. A virtual 3D anatomic skeletal model 306 of a spine ispreviously developed, being mobile, and translating variations in trunkposition into e.g. movement of the different vertebrae. By registrationof the individualized vertebrae of the 3D volume images and thevertebrae of the virtual 3D anatomical model 306, a virtual anatomicavatar of the patient's spine is developed. Pose changes in surfacereconstructions 112 are recognized by software means 318, and areautomatically translated into movement of the individualized parts inthe 3D anatomical avatar 320. Thus, usually supine position of thepatient 118 during CT scan image is automatically turned into the mostcommon prone position of the patient 118 in the operating room. This maybe performed by translating changes to the sagittal position of thereconstructed 3D surface 112 of the spine to the position of eachindividual vertebra and intervertebral disk in the anatomical model,which may be the 3D volume of each vertebra and intervertebral disk.

In that manner, the system may also be configured to determine changesin lateralization and rotation of the trunk in the 3D surface into theposition of individual vertebrae, vessels and nerves in the 3Dvolumetric image. In this manner, bone structures are more preciselylocated when dissecting through soft tissue (to achieve the bestpossible exposure), or when targeting bony structures, as in positioningof transpedicular screws. For example, when dissection is carried out tothe level of bone, and individual vertebrae are seen, a new surfacereconstruction 112 more accurately delimits the position and rotation ofprocesses, laminae, pedicles and posterior body of the vertebrae. Hence,the blending of 3D surface 112 with virtual 3D anatomical model, andregistration with 3D volume 120. Using appropriate segmentation andsoftware that takes into account such variations, provides a moreprecise location and orientation of each individual bony and soft tissuestructure, including adjacent vascular and neurological structures atrisk.

As another example, in percutaneous surgery for fractures, anindividualized volume rendering 104 is done of each fracture fragment inthe injured portion of the patient 118, as well as of the correspondinghealthy part, usually the contralateral limb or hemipelvis. The computer100 may be configured to determine pose changes in the different surfacereconstructions 112 of external and internal structures in the injuredpart. The computer may be configured to translate these movements to the3D rendered fracture fragments in the virtual anatomical model of theinjured part, changing their position in the virtual representation toaccurately reproduce their real-time location and orientation. Reductionof fragments is then adjusted with the help of the target virtualanatomical model of the healthy part (e.g. a skeletal virtual model),which is output superimposed with the desired level of transparency tothe injured part.

Intraoperative images 106 in combination with computer vision methodsand real-time user interface means 132 help the surgeon 128 position the3D volume image of individual fragments where they are in real time, asseen on the stereoscopic video 116.

When developing the virtual anatomical avatar for any location of thebody, known detailed muscles, ligaments, etc., bone models are used totranslate movements as precisely as possible. Such complex dynamicanatomical models are received and stored in the computer 100 fromavailable dynamic simulation models, or they may be newly created basedon the specialized literature. To apply these simulation models to adetailed 3D anatomical model, available models are stored and used, or anewly created 3D anatomy model may be created for that purpose.

In the exemplary embodiment described above in FIG. 2, the stereoscopiccameras 201, 202 and the 3D scanner 203 are closely arranged in aprecise fixed and known relative position to each other, so that theprecise relative location (x1,y1,z1) and orientation angle (α1,φ1,θ1) ofthe first camera 201 with respect to the 3D scanner 203, and therelative location (x2,y2,z2) and orientation angle (α2, φ2,θ2) of thesecond camera 202 with respect to the 3D scanner 203 are known. Therelative locations may be used by the computer 211 to automaticallyselect the position of virtual cameras for stereoscopic views of the 3Dsurface, according to the precise position of the cameras 201, 202relative to the coordinate system used by the 3D scanner 203. Thevirtual cameras that define the views of the 3D surface reconstruction112 change their location and orientation angle simultaneously as thedevices composing the stereoscopic camera system 114, automatically bycomputer means 100, with help from data acquired from tracking means 136on the cameras.

In embodiments, the position of the 3D scanner system 110 may be thereference position of the 3D scanner devices when constructing a surfacemodel 112 with respect to the object (e.g. the target portion of thepatient 118), according to its own coordinate system. This referenceposition is therefore selected as the virtual camera position from whichthe 3D surface model is seen in its own coordinate system, and may bethe initial or the final position in the object scanning process, or itmay be dynamic and moves with the scanner device (e.g. a time-of-flightcamera), or it may be any other alternative position selected by thecomputer means 100 that processes the 3D scan. In the embodiments, thestereoscopic cameras 201, 202 and the 3D scanner 203 may remain close toeach other in a fixed position. This may offer the most precise possiblesurface reconstruction 112 from the perspective view of the stereoscopicvideo 116 obtained, and to do the stereoscopic video—3D surfaceregistration 122 with the least effort on the side of computer means 100of the system, so that registration is quick and user adjustments neededare limited to the minimum.

In embodiments, the 3D scanner 206 is separated from the stereoscopiccameras 201, 202. In embodiments, their relative position is fixed andknown, and the same principles described for fixed positioning apply. Inembodiments, multiple scanners and/or video cameras are used, or theyare mobile, and their relative orientation angle is determined e.g. byIMUs attached to all devices, which show their orientation relative toeach other. In embodiments, the stereoscopic camera system 114 ismobile, and the same 3D scanner system 110 functions as markerlessoptical tracking system, determining the relative position of thecameras forming the stereoscopic camera system 114, if these are in thefield of view of the 3D scanner 110.

In embodiments, the 3D scanner system 110 is used in combination withmultiple optical markers placed on the stereoscopic camera system 114,for the precise real-time tracking of its location and orientation.Alternatively, or in addition to the 3D scanner system 110, opticaltracking means 136 are used for an accurate relative positioning ofcameras 114. Optical tracking is made with a tracker camera and markersthat locate the 3D scanner devices 110 and stereoscopic cameras 114 inthe common coordinate system, which may be based on 3D-2D pointcorrespondences.

In embodiments, the device or devices composing the 3D scanner system110 include one or more cameras (e.g. time-of-flight cameras). Thecamera or cameras may form part of the stereoscopic camera system 114.Registration of 3D surface and stereoscopic video 122 is therefore donedirectly within the coordinate system of the device.

In embodiments, the cameras forming the stereoscopic camera system 114are used to process a surface reconstruction of the portion of thepatient 118, for example using range imaging techniques (e.g.structure-from-motion). That surface reconstruction has a relativeposition that is determined with respect to the cameras forming thestereoscopic camera system 114. The surface reconstruction may be usedfor registration 122 with the 3D surface obtained by the 3D scannersystem 110, which may reduce the need for other tracking devices.

In embodiments, this surface reconstruction obtained from images of thestereoscopic camera system 114 is used directly for 3D volume—3D surfaceregistration, and this registered image is used for comparison with the3D volume—3D surface registration 120 made with the surfacereconstruction 112, to more precisely define the precise position of the3D volume in the coordinate system.

In embodiments, once the initial 3D volume—3D surface registration 120and stereoscopic video—3D surface registration 122 are done, the trackedlocation and orientation angles of the two-dimensional cameras 114 withrespect to the portion of the patient 118 (e.g. tracking the selectedanatomic landmarks of the patient 118) are used as the location andorientation parameters for the virtual cameras defined for capturing the3D volume images. In this manner, 3D volume—stereoscopic videoregistration is done directly, without an intermediate surfacereconstruction 112. This direct 3D volume—stereoscopic videoregistration is further adjusted through real-time user interface means132, and it may be combined with other images through image processing124, according to the different embodiments.

In embodiments, tracking means 136 may include a tracking camera thatworks in conjunction with active or passive optical markers that areplaced in the scene. In embodiments, the tracking camera may be part ofthe 3D scanner system 110. In embodiments, tracking means 136 includepassive or active optical markers that work in conjunction with thetracking camera. Different kinds of tracking systems may be employed,either alone or combined, such as magnetic tracking, inertial tracking,ultrasonic tracking, electromagnetic tracking, etc. Mechanical trackingis possible by fitting the joints of the mechanical arm 204 attached tothe ceiling 205 with encoders.

In embodiments, known optical markerless or marker-based trackingsystems are used and their data processed 326 by computer means 100 forthe tracking of location and/or orientation of the instruments anddevices 138, the patient 118, the imaging devices 106, the surgeon 128,and more precisely they are used for image processing 124, e.g. for 3Dvolume—3D surface registration 120, or for 3D surface—stereoscopic videoregistration 122, or even for direct registration of 3D volume withstereoscopic video 116, either alone or in combination with the otherembodiments described. Such image registration examples involve alsointeraction of the surgeon 128 or other users with the computer 100,through the available user interface means 130, 132.

In embodiments, the surgeon 128 uses a stereoscopic display 126 and canexamine the spatial relationship between the anatomical structures fromvarying positions.

Utilizing the stereoscopic display, the surgeon 128 may not need to lookback and forth between monitors and patient 118, and to mentally map theimage information to the patient 118. As a consequence, the surgeon 128can better focus on the surgical task at hand and perform the operationmore precisely and confidently.

In embodiments, the display 126 may be a simple stereoscopic videodisplay, or alternatively it may be a stereoscopic virtual realitydisplay, that allows for interaction of the surgeon 128 or other userswith the virtual environment created by computer means 100, e.g. throughmotion tracking and gesture recognition. In embodiments, as shown inFIG. 2, the stereoscopic display 214 is head-mounted 213, and it mayinclude headphones 219 and a microphone, to receive information in audioformat and be able to communicate with other users.

In embodiments, the stereoscopic display 126 is composed of a virtualreality device, and the stereoscopic camera system 114 is composed offixed cameras, or cameras that do not move with the head of the surgeon128, so that the stereoscopic view displayed to the virtual realitydevice moves as the head of the surgeon 128 moves, using its owntracking means 136 (e.g. optical marker-based tracking), without theneed to change the position of the cameras 114 to change the view. Forexample, if the stereoscopic camera system 114 obtains a 360° view ofthe target portion of the patient 118, when the surgeon 128 moves his orher head, that movement is tracked by the head-mounted display 126,changing the perspective view accordingly. Alternatively, the display126 may not be head-mounted, but e.g. a 3D monitor mounted on anarticulated mechanical arm, on a fixed pole, or any other suitablesupport, either fixed or mobile. Alternatively, the display 126 is atwo-dimensional display, and 2D video images 116 are displayed.

The stereoscopic camera system 114 displays a real-time stereoscopicimage to the surgeon 128 through the display system 126, which isconnected to the computer 100 by wired or wireless (e.g. Bluetooth orWi-Fi) means, depending on the needs of surgery and on the possibilitiesof the operating room. Computer means 100 integrate information from thedifferent devices used as described in the different embodiments, anddifferent combinations of stereoscopic video 116, 3D surface, and 3Dvolume images are shown in a stereoscopic manner to the surgeon 128,with the modifications, additions and deletions as adjusted by thesurgeon 128 or other users in real time, so that the surgeon 128 seesthe information that he or she wants (from the data available) at anytime during the procedure.

In embodiments, the stereoscopic display 126 is a virtual reality devicethat allows for gesture recognition (e.g. through motion tracking),displaying the real-time user interface means 132 within the field ofview of the surgeon 128. Such virtual graphics displayed within thefield of view of the surgeon 128 allow for interaction with the computer100. For example, the surgeon 128 may see virtual buttons in a marginalposition of his or her field of view, which may be pressed with apredetermined hand gesture. A virtual button, when pressed, may allowthe surgeon 128 e.g. to select a virtual image (e.g. a 3D volume image),and translate and rotate it with his or her hand movements, to adjustits position with respect to the stereoscopic video 116. In embodiments,voice recognition included in the display 126 or in another deviceconnected to the display 126 through computer means 100 allows e.g. toshow the virtual graphics (e.g. buttons) only when saying theappropriate word or words, so that virtual graphics for interaction donot interfere with surgery.

In embodiments, the stereoscopic video 116 may be directly sent to thedisplay 126, either by wired or wireless means. Through user interfacemeans 130, 132, the surgeon 128 can select that the stereoscopic video116 be directly sent to the display 126 after being received by thecomputer 100. Alternatively, the surgeon 128 may select that the videosignal from the stereoscopic cameras 114 be sent directly to the display126, through a direct wired or wireless connection between both devices.The stereoscopic video images 116 are received by the computer 100 forimage processing 124, e.g. for the stereoscopic video—3D surfaceregistration 122. The use of a direct connection between the video 116(or the cameras 114) and the display 126 makes the time lag negligiblefor practical purposes during surgery. Integration of the imagesprocessed 124 with the real-time stereoscopic video 116 sent directly tothe display 126 may allow for real-time user interaction via userinterface means 132. For example, the display 126 may show theregistered 3D volume blended with the real-time stereoscopic video 116image directly sent to the stereoscopic display 126, instead of thestereoscopic video image 116 processed 124 by computer means 100. Asanother example, the display 126 may switch to the images processed 124by computer means 100 only when the surgeon 128 allows for it, e.g. whenmore time lag is acceptable.

In an alternate embodiment, a stereoscopic optical see-through displayis used as display system 126 from those commercially available (e.g.optical see-through glasses), and real-time images of the patient 118are directly available to the surgeon's 128 point of view, instead of(or in combination with) the stereoscopic video images 116. The 3Dvolume blended with the 3D surface is tracked in space to the locationand orientation of the stereoscopic display 126, by way of trackingmeans 136 (e.g. markerless optical tracking and IMU in both devices).Alternatively, the tracked position of the head of the surgeon 128 isused to define the location and orientation of the virtual camerasoffering the stereoscopic view (or the virtual camera offering a 2Dview) of the 3D volume image. Thus, allowing for an instant registrationof the surgeon's 128 direct vision of the portion of the patient 118 andthe 3D volume image, that may be adjusted with the available real-timeuser interface means 132.

In an embodiment, the display system 126 is a projector which projectsthe processed images 124 over the target portion of the patient 118. Inorder to achieve a seamless integration with the surgeon's 128 view ofthe patient 118 during surgery, tracking means 136 are used toaccurately track in real time the position of the projector and of thehead of the surgeon 128, relative to the common coordinate system.Computer means 100, taking into account the location, orientation andlens-characteristics of the projector, send the blended images that areprojected over the patient 118, so that the images appear to the surgeon128 as a property of the target portion of the patient 118, with thedesired adjustments in transparency, color, contrast, etc. Inembodiments, the projector projects a stereoscopic view over the patient118, which is viewed by the surgeon 128 wearing the correspondingglasses, e.g. glasses with polarized filters for a projector that uses apolarization stereoscopic display system (a type of “passive”stereoscopic system).

In embodiments, changes in position of the surgeon's 128 head aretracked with a head-mounted virtual reality or any other stereoscopicdisplay 126 (the preferred stereoscopic video display for thisinvention, or alternatively the optical see-through display or projectoras described), so that the augmented reality view 346 offered to thesurgeon 128 changes in real time. In embodiments, to limit the time lagof the stereoscopic video images 116, the virtual graphics provideddirectly by computer means 100 using tracking 136 and software means aredisplayed directly to the surgeon's 128 field of view following his orher tracked head position. The virtual graphics are displayed alone,e.g. the registered 3D volume and 3D surface images, hence limiting thetime lag effect, and guiding the surgeon 128 in the actual scene that ishappening in the operating room. Alternatively, the virtual graphics aresent directly to the display 126, but combined with the availablestereoscopic video images 116 (sent directly to the display 126, orafter undergoing image processing 124), sacrificing a more preciselyblended image in exchange for less time lag with respect to the realscene.

Recording means allow one to record all images received by the computer100, and the augmented view 346 displayed to the surgeon 128.

In embodiments, the augmented view 346 may comprise a real view blendedwith virtual graphics. The real view is provided as stereoscopic videoimages 116 of the scene. The virtual graphics is derived from computermeans 100, e.g. 3D volume or digital images 108 of preoperative images102, generally a CT scan, or an MR scan, or a combination of them. Inthis case the virtual graphics also correspond to views of real anatomicstructures, available to the surgeon 128 only as computer graphicsrenderings.

The real view of the external structures and the virtual view 346 of theinternal structures are blended with the help of a surfacereconstruction 112, as already described, and they are shown in-situwith an appropriate degree of transparency, which may vary as the fieldof view changes. Registration between real and virtual surfaces makesall structures in the augmented view 346 be positioned in the correctlocation with respect to each other, therefore the derived image of theinternal anatomical structure is directly presented in the surgeon'sworkspace in a registered fashion.

The image display of the 3D volume obtained through volume rendering 104is the virtual 3D representation of a volume data set as it is“flattened” onto one or more 2D planes. Different techniques areavailable using software-based and hardware-based solutions that mayoptimize image accuracy, speed, quality, or a combination of them.Nonlimiting examples of techniques for 3D volume image display includeray casting, fly-through, multiple views, obscured structure, shadingdepth cues, kinetic and stereo depth cues. Such techniques are availablein the computer means 100, and are selected by the surgeon through theavailable user interface means 130, 132.

According to the embodiments described for surface reconstruction 112,the position of the devices that compose the stereoscopic camera system114 and/or 3D scanner system 110 is dynamically tracked to display theprecise location and orientation of the 3D surface model with respect tothe view of each camera 114, which may provide a blended image output340 to the display 126.

Therefore, when 3D volume—3D surface registration 120 is presented, astereoscopic view of the 3D volume is presented 342 (either fullyautomatically by software or with user interaction 350). The 3D volumemay take into account the pose changes of the patient 344, and it isdisplayed to the surgeon 128 according to the precise location of thecameras 114. Accordingly, there may be a seamless integration of allviews (e.g. stereoscopic video 116, 3D surface and 3D volume) to botheyes as an augmented view 346 of the surgical field. The same principlesapply to all images displayed stereoscopically to the surgeon 128, whenfitting the reconstructed 3D surface of the portion of the patient 118,as well as to the different augmented reality helps 346 displayed, suchas notifications 348. For example, when using a pair of two-dimensionalcameras as the stereoscopic camera system 114, the virtual cameras (thatdefine stereoscopic views of the 3D surface) are positioned by computermeans 100 within the common coordinate system to the correspondingposition of the two-dimensional cameras (and thus the specificinterpupillary distance selected by the surgeon 128), each image beingdisplayed to each corresponding eye of the surgeon 128 by thestereoscopic display 126, together with the corresponding stereoscopicvideo image 116. The surgeon 128 and other users are able to define oradjust the position of the virtual cameras defining the 3D volume and/orthe 3D surface using the available general user interface means 130, andthe real-time user interface means 132 during surgery.

In embodiments, the virtual cameras based on the position of thestereoscopic camera system 114 are also used to create views of the 3Dvolume for registration with the 3D surface, and the stereoscopic viewof the registered image is automatically blended with the stereoscopicview of the registered 3D surface—stereoscopic video 122. Alternatively,the virtual cameras defined are used for direct registration with thestereoscopic video 116. Alternatively, the virtual cameras are based onthe position of the head of the surgeon 128, and they are used to beshown in the optical see-through display, or alternatively they areprojected over the patient 118, according to the different embodimentsof the present invention.

Once the initial position of the virtual cameras is determined, computermeans 100 translate their location and orientation changes in real time,with data acquired from tracking means 136, e.g. according to thecorresponding position of the devices forming the stereoscopic camerasystem 114. Therefore, the position of the device or devices composingthe stereoscopic camera system 114 is tracked, and changes in locationand rotation (relative to their initial position during registration)are translated to the position of the corresponding virtual camera orcameras. In this manner, when using a mobile stereoscopic camera system114, the surgeon 128 sees a direct, seamless view of the inner anatomicstructures of the patient 118 blended with the stereoscopic video 116that defines his or her basic view. As another example, when usingtracking data from a head-mounted stereoscopic display 126 (e.g. virtualreality device), changes in its position are translated to the positionof the virtual cameras defined, so that the perspective view of thevirtual graphics (e.g. the 3D volume) changes to fit the view of thesurgeon 128.

In embodiments, especially in settings where preoperative volumetricdata is not available, and simple X-ray images and intraoperativefluoroscopic X-ray images are frequently used, a mirror system is usedattached to the imaging device (e.g. a C-arm) that obtainsintraoperative images 106, making the video optical center of the 3Dscanner device 112 (e.g. a time-of-flight camera) or the video camera114 virtually coincide with the X-ray source. The depth sensor from the3D scanner device 110 used (e.g. a time-of-flight camera) needs to beadjusted to the distance of X-ray source to the detector, and also tothe distance from the mirror. In embodiments, a stereoscopic C-arm isused that contains two X-ray sources and detectors, each source with avideo camera 114 or a 3D scanner device 110, or both, attached to themin the described manner. Image registration is done according to theprinciples of the present invention, whereby the surgeon 128 seesreal-time stereoscopic video images 116, and the intraoperative X-rayimages (either stereoscopic or not) blended with the video images in itsprecise location over the patient 118.

In embodiments, the digital images 108 or the 3D volume of preoperative102 or intraoperative 106 images, after being registered with thestereoscopic video 116 images, are displayed blended with thestereoscopic video 116 by means of classical alpha blending. Duringalpha blending, the registered digital images or virtual graphics aredirectly superimposed to the stereoscopic video 116 images, and thesurgeon 128 sees the digital image 108 or 3D volume over the visiblescene, with the selected transparency level, color adjustments, etc.

In other embodiments, the digital images 108 or the 3D volume ofpreoperative 102 or intraoperative 106 images, registered with thestereoscopic video 116, are displayed blended with the stereoscopicvideo 116 by means of real-time background subtraction methods. Inreal-time background subtraction methods the foreground objects in thevideo or 3D surface images are detected from the different frames (byusing reference frames), and thus a background image or model isobtained. For example, in an embodiment using a time-of-flight camera as3D scanner system 110, multiple real-time images with color and depthinformation are obtained, and an algorithm (e.g. random forests) isapplied to the pixels. The pixels may be classified to belong to anobject class, either foreground objects (including e.g. the surgeon's128 hands and instruments 138) or the background model (including thepatient 118), to obtain a probabilistic output of the objects. Theseobjects are identified as label maps, creating then a pixel-wise alphamap, and then using a mixing look up table (LUT) that associates aspecific alpha value to each label pair.

Once the mixing LUT is obtained, higher values are given to surgicalinstruments 138 and surgeon's 128 hands over the background, giving abetter depth perception. As another example, foreground objects aresimilarly identified and classified from RGB data of stereoscopic video116 or 3D scanner 112 images, especially metallic, bright, thin- andsmall-shaped instruments or devices 138 (e.g. drill, clamps orscalpels), which are not well reflecting infrared light. A combinationof IR- and RGB-based information is calibrated in advance, according tothe tools used and the light conditions, and can be further adjusted inreal time through the available user interface means 132.

In embodiments, the 3D surface corresponding to the foreground objects,obtained by the background subtraction methods applied to the 3D surfaceimages, is superimposed to the registered and blended stereoscopic video116 and 3D volume image, using the virtual camera position of eachstereo camera 114 to give a perspective view of the surgeon's 128 handsand instruments 138 corresponding to each eye, giving a more realisticaugmented view.

In an embodiment, the identified foreground objects from thestereoscopic video 116 are superimposed over the blended 3Dvolume—stereoscopic video images. In an embodiment, the precise locationof the background model in the common coordinate system is used tosuperimpose the 3D volume over the background model located on each ofthe stereoscopic video 116 images, leaving the foreground objects (e.g.surgeon's 128 hands and instruments 138) as the original stereoscopicvideo, without superimposed images.

These foreground objects can then be adjusted in transparency, color,etc. to permit the surgeon 128 to see the virtual graphics through them,e.g. the internal anatomy of the patient 118 in the registered 3D volumeimage, or the graphical representation of tracked instruments anddevices 138.

In embodiments, markers are placed in the target portion of the patient208 (the target background model), or the surgeon's 128 hand orinstruments 138 (the target foreground objects), or in both locations.These markers are tracked by the tracker camera (e.g. forming part ofthe 3D scanner system), and help obtain a quicker and more preciseposition of objects. In embodiments, background subtraction methods makeuse of hardware or software capabilities (e.g. motion detection) of the3D scanner devices 110, the 3D display 126, or both, to locate theforeground or moving objects.

In embodiments, to take into account real-time, small adjustments in themoving foreground objects, an area surrounding the foreground objects inthe target video image is defined by the user interface means 132. Thiswill adjust for minimal real-time movements of the hand or instruments,leaving the background model bigger or smaller than it actually is,depending on the preferences of the surgeon 128. This defined area canbe modified in real time (augmented or diminished) by the surgeon 128 byusing the real-time user interface means 134.

In embodiments, a motion controller is used, which can be a differentdevice from those described, or one or more of those used inembodiments, such as the 3D scanner device 110 (e.g. time-of-flightcamera) or the stereoscopic display 126 (e.g. virtual reality display),using their own computer-implemented software for gesture recognition.Gesture recognition is used as real-time user interface means 132 e.g.to more accurately adjust the position of the stereoscopic cameras 114,the registration of 3D volume image and stereoscopic video 116, thesurface reconstruction 112 (or its parameters), the image processing 124(e.g. stereoscopic views of the 3D volume and other images), and anyother possible software-controlled task of this invention, avoiding theneed to touch a device (e.g. mouse or keyboard). For example, whenadjusting pose changes 320, gesture recognition may also be used toadjust the position of the different individual bony or soft tissuestructures in the 3D volume—3D surface registration 120, in thestereoscopic video—3D surface registration 122, or in any other imageprocessing 124 task. In embodiments, voice recognition is used tointeract with the computer 100 through spoken commands, alone or incombination with other computer interface means.

Gesture or voice recognition allows the surgeon 128 to adjust the levelof transparency of each superimposed image and their order ofvisualization; to select the exact layer of the 3D volume image to show;to do with gestures or spoken commands any software-implemented taskideally without touching any device, although other interaction meansare available, such as mouse and keyboard. Gesture or voice recognitionare activated via a specific gesture, voice command, or by other users,so that it is not active all the time, to avoid real-time interpretationof normal hand movements and normal speech during surgery.

An optional remote user interface allows an additional user to see andinteract with the augmented view during the system's real-timeoperation, as described in this invention. For example, the surgeon 128may show another person the points to adjust, or tell this person whatto change, and this person directly or indirectly interacts with thecomputer 100 to make the changes, as in the nonlimiting examples ofaugmented reality described below.

In embodiments, the registered 3D volume image is displayed to thesurgeon 128 automatically by layers, e.g. each layer may correspond tothe estimated depth that the surgeon's 128 instruments and devices 138have achieved. Thus, when dissection is carried out e.g. through thesubcutaneous tissue, the superficial layers of the 3D volume image arealso automatically made fully transparent, and the deepest layerachieved with the scalpel is shown, or layers above or below the deepestlayer are shown with transparency degree as determined by the surgeon128 through the available interface means 132, to obtain the mostintuitive and useful augmented view of the target portion of the patient118. As another example, only the part of the 3D volume image layer thathas been dissected and the selected margin width are made transparent,or other augmented reality modifications are made to them, while theregion outside the surgical wound remains in the original state.

In embodiments, the maximum depth achieved by the instruments or devices138 is obtained by optical tracking from the 3D scanner device 110. Forexample, depth is estimated from the 3D surface reconstructed 112according to the device's own coordinate system, in real time (e.g. witha time-of-flight camera). In embodiments, depth calculation is enhancedby using previously made 3D models of the instruments or devices 138,hence more accurately tracking its actual depth and orientation withsoftware-based calculations. In embodiments, depth is calculated bycomputer means 100 from the stereoscopic video images 116 (e.g. rangeimaging techniques as in stereo triangulation, or stereophotogrammetrictechniques). In embodiments, marker-based optical tracking is used forreal-time location and orientation of instruments and devices 138. Inembodiments, IMUS attached to the instruments and devices 138 helptracking their real-time orientation. Alternatively, depth is obtainedby other tracking means 136 or a combination of them, as described inthe different embodiments.

In embodiments, layers of the 3D volume image, blended with thestereoscopic video 116 in the preferred manner, is given a certainpercentage of transparency. For example, beginning with 50% for theupper layer (that is the deepest layer achieved e.g. by the scalpel),and being increased following a certain increasing pattern for deeperlayers. That way, the surgeon 128 is able to know or more preciselyimagine (according to his or her own knowledge) which internalstructures may lie ahead and be at risk if dissection is carried outdeeper or to the sides of the surgical wound.

In embodiments, intraoperative images 106 (e.g. fluoroscopic images) aredisplayed directly to the surgeon 128, usually in a marginal positionrelative to his or her field of view. In embodiments, intraoperativeimages 106 are tracked to the position of the patient 118 with the helpof markerless optical tracking, or alternatively using optical markers(e.g. placed on selected anatomic landmarks of the patient 118, and onthe imaging device 106), or by other tracking means 136 or a combinationof them, as described in the different embodiments. For example,intraoperative CT scans or MR scans are tracked to the patient 118, oralternatively e.g. with the help of optical markers on the patient 118and the CT or MR scanner 106, and/or with a tracker camera. As anotherexample, stereoscopic fluoroscopic images taken intraoperatively (eitherwith a specialized image intensifier, or by translating or rotating theimage intensifier between images), are shown as a stereoscopic pair ofimages to the display 126, each image to the corresponding eye of thesurgeon 128, either tracked previously to the position of the targetportion of patient 118 or not. Alternatively, intraoperative images 106like fluoroscopic images and CT scans or MR scans are indirectly trackedto the patient 118, automatically by software using the alreadyreconstructed and registered 3D volume—3D surface 120 and 3Dsurface—stereoscopic video 116, and assisted manually by the surgeon 128or other users by interaction with the computer 100. Such markerlessregistration using other precise digital images 108 (e.g. CT scan) orthe processed output based on them (e.g. 3D volume image) allows forreal-time correction of scale of the fluoroscopic image, either fullyautomatically or adjusted by interaction of users with the softwarethrough the available user interface means 130, 132. In embodiments,tracking means 136 for location and orientation of the intraoperativeimaging device 106 (e.g. image intensifier) and the selected anatomiclandmarks of the patient 118 give enough data for automated calculationby computer means 100 of the actual size of the imaged structures, sothat a precise positioning of the fluoroscopic images is done with e.g.the surface reconstruction 112. The same principles apply to otherintraoperative image sources 106, e.g. arthroscopy or ultrasound.

In embodiments, registration of the stereoscopic video 116 is done witha 3D atlas model of the target part of the human body, that consistse.g. of another patient's 3D volume image, or a volume rendering of realanatomic slices (e.g. the Visible Human Project's corresponding male orfemale dataset), or stereoscopic videos previously recorded (e.g. fromsimilar exposures and surgical techniques), or a 3D virtual anatomicatlas designed in a computer, etc. or a combination of such atlasmodels. Such registration is done automatically in the most accuratecorresponding place over the target portion of the patient 118,according to the principles of this invention, and with help frominteraction of the surgeon 128 or other users, either preoperatively orduring surgery, e.g. adjusting the size and position of the virtualpatient to the real patient 118, the position of their internalstructures, etc. using the general user interface means 130. Inembodiments, the 3D atlas models are previously processed, e.g. clearlymarking important anatomic landmarks and structures at risk, and alsodifferentiating the individual parts and making them movable, to adjustfor pose changes in the patient 320, applying the principles of thisinvention. The registration is also adjusted in real time, as thesurgery develops, using the real-time user interface means 132.

In embodiments, only the 3D surface obtained with a 3D scanner 110 isused (without registration with 3D volumetric image of the patient 120),and thus only registration between the 3D surface and the stereoscopicvideo 122 is done. Measurements of distances and angles areautomatically calculated by computer means 100 from the 3D surfacemodel, with or without interaction by the surgeon 118 or other usersthrough the available user interface means 130, 132. For example, theappropriate location and angulation of the tibial cut in a total kneearthroplasty, or the CORA in a proximal tibial osteotomy, are calculatedand displayed graphically in real time to the surgeon 118, using e.g. asurface reconstruction 112 of the lower limb or limbs of the patient118, with or without intraoperative imaging 106, according to theprinciples of this invention. Measurements and calculations based on the3D surface may therefore be improved by doing new surfacereconstructions 112 when achieving deeper layers of dissection, e.g.when exposing distal femoral and proximal tibial bone during kneearthroplasty, and also when a 3D model atlas (e.g. of a knee) is usedfor registration of 3D surface—3D model atlas during surgery, accordingto the principles of the present invention.

In embodiments, optical tracking markers and IMUs are also placed in theselected landmarks of the patient 118, and/or in instruments 138, tooffer a more precise positioning of the internal structures of thetarget portion of the patient 118, for example in cup positioning duringtotal hip arthroplasty. In embodiments, measurements and calculationsare made directly over the stereoscopic video 116, with range imagingtechniques (e.g. stereo triangulation, or structure-from-motion) orstereophotogrammetric techniques, alone or in combination with the otherembodiments.

In embodiments, the registered 3D volume is adjusted during surgery,according to the images taken intraoperatively 106, such as fluoroscopicimages. For example, after reduction of fracture fragments, by comparingthe newer fluoroscopic images to the previously obtained 3D volume, eachfragment individualized in the 3D volume is translated and rotated tothe most exact current position, either automatically by computer means100 (e.g. by computer vision software) or generally with interaction ofthe surgeon 128 or other users through the real-time user interfacemeans 132.

In an alternate embodiment, intraoperative images 106 (e.g. multiplefluoroscopic projections, stereoscopic or not) are registered with thereconstructed 3D surface. Thus, for example, fluoroscopic images takenintraoperatively are displayed to the surgeon 128 in the correspondingplanes with respect to the position of the target portion of the patient118, blending it with the stereoscopic video 116, and with the preferredtransparency, color and contrast values, so that the appropriate entrysite location and orientation angle for screw or pin placement is moreeasily and intuitively determined, e.g. in fractures of the pelvis,after reduction of the fragments, or in scoliosis surgery, for screwpositioning.

In embodiments, the general user interface 130 and the real-time userinterface 132 allow the surgeon 128 and other users to control the imageprocessing 124 before and during surgery, i.e. to control the augmentedreality help 134 sent to the display 126. It allows the surgeon 128 tointeractively change the augmented view, e.g. invoking an optical ordigital zoom, switching between different degrees of transparency forthe blending of real and virtual graphics, show or turn off differentgraphical structures, etc.

In embodiments, a graphical representation of the instruments anddevices 138 used during surgery is available as virtual graphics in thecomputer 100, and may be selected through the available general userinterface means 130, or through the real-time user interface means 132during surgery. That graphical representation of the instruments anddevices 138 is available as a 3D virtual representation (e.g. in STLfile format). The 3D virtual representation is obtained directly fromthe manufacturer; or automatically acquired as a 3D surface by the 3Dscanner device 110 (either done before or during surgery), or bysoftware from the video images 116 (e.g. by range imaging techniques);or as a 3D volume rendered 104 from a CT or MR scan data; or asregistration of 2D radiographic or fluoroscopic imaging with 3Dstatistical shape models of similar instruments or devices; or as asimple graphical representation drawn or designed with the known sizeand shape and added to computer means 100 using the general 130 orreal-time user interface means 132.

In embodiments, motion tracking software from the 3D scanner device 110(e.g. time-of-flight camera) or from the stereoscopic display 126 (e.g.virtual reality device) automatically recognizes the size and shape ofthe instrument or device 138 used, and automatically selects thecorresponding shape and sizes as the virtual graphics for the instrumentor device 138. In embodiments, automatic recognition of size and shapeof instruments and devices 138 is done by computer means 100 fromstereoscopic images taken by the stereoscopic camera system 114 (e.g. byrange imaging techniques as stereo triangulation, orstereophotogrammetric techniques), or from stereoscopic radiographicimages (e.g. by Roentgen stereophotogrammetry). The real-time locationand rotation of the instruments or devices 138 represented as virtualgraphics are tracked according to the different embodiments alreadydescribed, and blending of the virtual graphics with the availableimages is done using the common coordinate system, through imageprocessing 124 (e.g. with background subtraction techniques) as alreadydescribed, with interaction from the surgeon 128 and other users.

For example, during percutaneous surgery of e.g. pelvis fracture, whenreduction is achieved and a pin is inserted as a guide for thedefinitive screw, both the pin and the screw have a virtualrepresentation in shape, size, and length, and their rotation andorientation is tracked with the available tracking means 136. Therefore,when the pin or screw enters the inner structures of the body, agraphical representation of a pin or screw with its corresponding shapeand size, and with its real-time location and orientation, is displayedto the surgeon 128 in its precise position in the common coordinatesystem, e.g. blended with the 3D volume image (which is in turn blendedwith the stereoscopic video 116), according to the principles of thisinvention, assigning the desired transparency level, color adjustment,etc. In this manner, the surgeon 128 directly sees an intuitivegraphical representation of the inner anatomic structures of the targetportion of the patient 118, and of the instruments or devices 138inserted, in their precise real-time position.

In embodiments, the preoperative images 102 and the real-time videoimages 116 are processed, via classification, to identify structures inthe images. Various statistical image-based classification methods allowimages of healthy anatomical structures to be distinguished fromunhealthy structures (e.g. diseased, malignant, torn/broken/ruptured,etc.), and therefore aid in the identification of conditions bystereoscopic imaging. Functions for performing such classifications maybe trained using a training set of image data comprising a variety ofdifferent tissue conditions for each anatomical structure of interest toallow conditions of anatomical structures to be distinguished. It willbe understood that such processing may occur at the time of imageacquisition and storage (e.g. prior to surgery for the preoperativeimages 102), and/or may be performed at the time of surgery.

In embodiments, image rendering principles and algorithms are applied tothe real 2D or stereoscopic video images 116, or to the 3D surface, todetect structures at risk, using color data or texture patternrecognition software, to automatically identify certain structures bytheir usual color or texture, such as nerves or vessels. They are thendisplayed by computer means 100 automatically through image processing124 as determined by the surgeon 128 or other users through availableuser interface means before 130 or during surgery 132, e.g. in enhancedor different colors (e.g. bright red for arteries), to alert the surgeon128 of their existence and position. In embodiments, such recognition ofstructures at risk is done with the help of registration with the 3Dvolume, e.g. with the previous classification of anatomical structuresat risk, either fully automatically through software with statisticalimage-based classification methods, and/or through interaction of thesurgeon 128 or other users with the computer 100, using the availableuser interface means 130, 132.

In embodiments, markerless motion capture software is used forrecognition of moving structures (e.g. vessels), and these structuresare displayed by computer means 100 automatically through imageprocessing 124 to the surgeon 128 with a predefined alert notification,e.g. color or contrast change, or showing the suspected (arterial vs.venous) flow, similar to the visual graphics aids shown in Dopplerultrasound. In embodiments, real-time ultrasound or Doppler ultrasounddevices are used for identification of structures, blending imagesaccording to the principles of this invention (e.g. with tracking of theultrasound device to the patient 118 and/or to the common coordinatesystem, and placing the ultrasound images in their corresponding plane),as already described, so that the precise location and orientation ofstructures is done to the images available through image processing 124.

In embodiments, motion capture software used for human movementrecognition is used for skeletal tracking (e.g. with a time-of-flightcamera with the appropriate software), and with a previously definedanatomic model (and software for registration of the anatomic model withthe 3D volume), each pose change in the patient's 118 skeleton isautomatically tracked by software means, and automatic registration of3D volume—3D surface 120 is done, either as a whole or with anatomicparts individualized, as already described above in the differentembodiments.

In embodiments, tracking of joint movement of the patient 118 is usedfor defining the center of rotation of each joint, and the location andorientation of each joint in their different defining planes. Suchinformation is displayed as numbers and as virtual graphics directly andin real time to the surgeon 128. For example, when applying anarticulated elbow external fixator, the most precise center of rotationof the elbow of the patient 118 is identified in real time by thesoftware during surgery, so that the best flexion/extension arc of theelbow is permitted after the surgery.

In embodiments, where color or light changes are used for theidentification of internal anatomic structures of the patient 118, as influorescence-guided surgery (e.g. 5-ALA PDT in malignant gliomas), evenwhen the eye cannot see the color change under certain circumstances,computer means 100 recognize it from the stereoscopic video 116, andimage processing 124 is applied (e.g. changes in color or contrast) tothe video image displayed to the surgeon 128, to make such changesvisible for the surgeon 128 under the usual light. In this manner, theneed to change ambient light and observe fluorescence, change ambientlight again to continue the surgery, change light and observefluorescence again, and so on is avoided. Accordingly, surgical time maybe minimized and precision is enhanced.

In embodiments, the computer 100 is connected to an intranet or theInternet, allowing for interactive communication with other people. Theaugmented view displayed to the surgeon 128 is provided to devicesconnected to the computer 100, so that the augmented view provided tothe surgeon 128 is shared. In this manner, the surgeon 128 and otherusers (e.g. an observer or associate) may communicate, the other userswatching the augmented view on a monitor, stereo monitor, a head-mounteddisplay, or any other 2D or stereo display. For example, the augmentedview can be observed by a staff when the resident is operating, or byanother expert like a radiologist, pathologist or oncologist. They canperform actions to enter data, such as by way of an interface to thecomputer 100 (mouse, keyboard, Trackball, etc.) or e.g. through gesturerecognition software, certain features to the surgeon 128 by addingextra graphics to the augmented view or highlighting existing graphicsthat is being displayed as part of the augmented view.

In embodiments, virtual reality headsets are used in combination with a3D model of the surgeon 128 and/or the other users (e.g. obtained with a3D scanner), and/or a motion capture device (e.g. time-of-flight camera)pointing at each user interacting with the system, and appropriatesoftware (that e.g. uses a 3D virtual anatomical model of a person witharticulated and moveable joints, blending with it the available 3Dsurface models of the users) to show the users within the field of viewof the surgeon 128, and/or the surgeon 128 within the field of view ofthe users, in real time during surgery, showing their movements, e.g.movements of their hands, for example with real or virtual instrumentsor devices, over the target portion of the patient 118. In this manner,any user is able to interact directly with the surgeon 128 as a 3Davatar in his field of view in real time during surgery, through thevirtual reality display, without the need to share the same room.

In embodiments, a notification system 348 is developed preoperatively.Examples of notifications include, but are not limited to, an alert whenapproaching certain zones or depth layers (or a location matching both,a zone and a depth layer) of the 3D volume or 3D atlas, to avoid certainimportant structures at risk of lesion during surgery. Suchnotifications are output visually to the display 126, audibly viaheadphones 219 or a speaker in the operating room, or in any othersuitable manner. Another example includes that, when the instruments 138are near those zones or layers, the preferred notification is displayeddirectly within the surgeon's 128 field of view. The same principle isused to mark the incision lines of the preferred exposures, ortrajectory for instruments or devices 138 (e.g. pins or screws),according to previously defined data, similar to a GPS-based autonavigation system.

In another example, information of nearby structures and their distanceand precise position is displayed graphically to the surgeon's 128 fieldof view, according to the automated calculations made by computer means100, following the principles of the present invention, e.g. whilecarrying out a dissection, or while reducing a fracture, or positioningan implant. Another example involves using color codes for certainalarms displayed, wherein graphical representations of the instrumentsor devices 138 used may be turned red when approaching vital structuresor when moving away from the desired rotation or location, or green whenfollowing the preoperative planning, or when moving closer to thedesired rotation or location.

In embodiments, automatic measurements are made and instantly displayedto the surgeon 128 by computer means 100 from the stereoscopic videoimages 116 or a combination of images taken from some or all of thedevices from the stereoscopic camera system 114 and/or 3D scanner system110, e.g. by range imaging techniques (e.g. structure-from-motion), orstereophotogrammetric techniques. These automatic measurements ofportions of the patient 118 and/or instruments and devices 138, may helpwith accuracy of depth of penetration of instruments or devices 138 intothe target portion of the patient 118, or determining the current poseand pose changes of the patient 318, etc. Using the same principles inX-rays, with markerless or marker-based Roentgen stereophotogrammetry(with stereoscopic fluoroscopic images), further adjustments are madefor the correct positioning of e.g. instruments and devices 138, ordetermining patient pose 318, or positioning of fracture fragmentsreduced during surgery, etc.

In embodiments, preoperative planning done by software means is shown inreal time blended with the preoperative 102 and intraoperative images106 or their graphical representation, e.g. blended with the 3D volume,with the proposed measurements, angles, incisions, osteotomies, etc.drawn and marked. The planning is done and displayed as virtualgraphics, either as a 2D drawing or design, or as 3D representation inany of the available file (e.g. STL) or video formats, in stereoscopicmanner or not. For example, the appropriate step in surgical techniqueor guide (from the manufacturer of equipment, or self-made by thesurgeon 128 or other users) is shown each time a predefined previousstep is completed or skipped (as interpreted automatically by computermeans 100, or indicated by the surgeon 128 through real-time interfacemeans 132): graphical representations of the correct or possibleinstruments and devices 138 are shown, or marked when on the surgeon's128 field of view; the incorrect ones are marked (e.g. with colors);possible alternative steps and instruments are shown. Thisintraoperative help includes e.g. any aspect of the surgery ortechnique, such as plates, screws, sutures, or any other instruments ordevices 138, their different sizes, shapes, materials, or a combinationof them, available as virtual graphical representations.

In embodiments, augmented reality helps predict the outcome of aprocedure. As one example, a virtual model of a flap drawn and displayedstereoscopically blended with the patient's 118 3D surface or 3D volumecan be manipulated virtually by the surgeon 128 within his or her fieldof view, for demonstrating potential outcomes of microsurgery, by addingshape-mapped, scale-mapped, and texture-mapped images blended with thetarget donor or acceptor portion of the patient 118, or both.

In embodiments, graphical representations (e.g. digital 2D or 3Dtemplates) of instruments and devices 138 (e.g. plates or nails) aredisplayed stereoscopically and “tried” virtually in real time blendedwith any of the images displayed to the surgeon 128. For example, once afracture is reduced or certain steps in surgery have been accomplished,virtual representations of e.g. a plate are virtually tried in theprocessed images 124 displayed to the surgeon 128 (e.g. blended with the3D volume and stereoscopic video 116 according to the principles of thepresent invention), so that the surgeon 128 does not need to try thereal plates (or needs to try less plates) over the fracture fragments,hence reducing surgery time and avoiding complications derived from e.g.making a bigger incision, stripping more periosteum for exposure, etc.In embodiments, automated measurements and calculations of instrumentsand devices 138 are made by computer means 100 and displayed to thesurgeon 128 in the same manner. For example, when broaching or reamingbone, or when inserting pins, their estimated depth within bone isautomatically calculated, so that measuring of depth with mechanicaldevices, for estimation of the size of the definitive nail or screw tobe inserted, is not necessary.

In embodiments, computer vision algorithms are used for real-timeaugmented reality help 134. Thus, computer means 100 display directly tothe surgeon 128 e.g. which fracture fragments may correspond with whichby automatic segmentation software for bone contouring (or e.g. byregistration with an image representing the healthy structure, asdescribed above for a fracture involving a hemipelvis), which locationand rotation is ideal for arthroplasty or ligament plasties, whichlayers and borders of the wound correspond with which (to close thewound more precisely), and so forth. As another example, computer visionsoftware helps in classification (as a statistical image-basedclassification method, as described in the different embodiments above),e.g. detecting soft tissue or bony lesions, by patterns of typicalfractures, or patterns of typical tendinous or ligamentous pathology.

In embodiments, 3D volume analysis is done comparing preoperative andintraoperative 3D surfaces obtained from 3D scans of the patient 118, toevaluate for deformity correction during surgery, for example a thoraxsurface reconstruction is used for pectus excavatum or pectus carinatumcorrection, cervico-thoraco-lumbo-sacral surface reconstruction is usedfor scoliosis correction, or pelvic and limb surface reconstruction isused for limb deformity correction.

In embodiments, the stereoscopic camera system 114 has a digital oroptical zoom feature that can be utilized during surgery according tothe surgeon's 128 needs, to see a magnified augmented view, interactingwith the computer 100 through the available user interface means 132.Their precision is enhanced by software (e.g. color, contrast) andexternal (e.g. light, camera loupes) means as necessary, to more clearlyappreciate the anatomy, thus eliminating or reducing the need for otherexternal devices, like surgical loupes or microscopes. The zoom valuesare also applied to the blended images (e.g. 3D volume, graphicalrepresentations) displayed automatically by computer means 100.

In embodiments, the surgeon 128 may input information regarding thecondition of the patient 118 for storage with the preoperative images102. Such information may include, but is not limited to, informationrelated to the diagnosis of the patient 118, and information related toa surgical procedure to be performed on the patient 118. With suchinformation, real-time video images 116 may be compared with thepreoperative images 104, 108 or the graphical representation of them(e.g. 3D volume) to determine whether the observed surgical images matchexpected surgical images based upon the patient condition information.Further, notifications may be generated and output based upon suchcomparisons. For example, in embodiments, the 3D surface may be comparedto such data used to determine whether a surgery is performed on acorrect body part or side. As a more specific example, the 3D volume—3Dsurface registration acquired during surgery may be analyzed todetermine that a surgeon 128 is operating on a particular limb of apatient 118, on the correct vertebra or bone within the target portionof the patient 118. Such information may be compared tocondition-related information to determine whether the procedure isbeing performed on the correct limb, target portion or level of theportion of the patient 118.

In embodiments, surgical or technical videos (stereoscopic or not)stored in the computer 100 or streamed from the intranet or Internet maybe displayed directly to the surgeon's 128 field of view, blended withthe other images displayed, at his or her own discretion (throughreal-time user interface means 132). Other examples of images that maybe displayed include images from surgical atlas, book pages andillustrations, surgical techniques and guides, etc. Therefore, virtuallyanything available in digital format that helps the surgeon 128 outsidethe operating room may be displayed in real time during surgery in thesurgeon's 128 field of view, according to this invention.

In embodiments, arthroscopy is done with the video images from thearthroscope displayed directly to the surgeon 128 in his own field ofview. For example, the images displayed include registered (external)stereoscopic video 116, 3D surface, 3D volume, and other digital images108, as well as any augmented reality help 134.

In case the arthroscopic imaging is stereoscopic, each image isdisplayed to the corresponding eye of the surgeon 128 through thestereoscopic display 126. Through tracking means 136 on the arthroscope,the principles of this invention are applied for automated registrationof internal (arthroscopic) stereoscopic video with 3D surface, 3Dvolume, digital images 108, and external stereoscopic video 116.Accordingly, stereoscopic video images of the external surface andinternal structures may be displayed, either alone or combined in thesurgeon's 128 field of view, registered and blended with the availablepreoperative 102 or intraoperative 106 images. Augmented view examplesinclude its use to observe how potential stitches (using the availableimages, such as the 3D volume rendering 104) may interact with otheranatomy, and to choose stitch locations based upon such demonstrations.Another example involves the use of image registration and augmentedreality help to achieve the best possible position for tibial andfemoral tunnels during ligamentoplasty.

In case of ultrasound and ultrasound-guided surgery, ultrasound imagesmay undergo automatic registration and be blended with the other imagesavailable, or may be displayed directly to the surgeon's field of view128. An example includes tracking the ultrasound probe positioning,which is done with any combination of the aforementioned tracking means136, to allow for an automatic registration of stereoscopic video 116with the ultrasound images, positioning the ultrasound images on theprecise plane over the target portion of the patient 118, with theimages displayed as viewed from a virtual camera sharing the locationand orientation angles of the stereoscopic camera system 114, blendedaccording to the principles of the present invention.

As examples of robotic feedback, gloves and other hand-wearable devicesare used for active motion, and also for passive feeling. Inembodiments, “active” wearable gloves are used during surgery, limitingthe movement of fingers, hand, and wrist of the surgeon 128, byaugmenting resistance, or even completely blocking movement of thejoints, when the instruments and devices 138 approach predeterminedstructures at risk. The computer 100 sends the signal to the gloves toblock flexion (and/or extension) of the surgeon's 128 hand and wristjoints when getting closer e.g. to the incision's border or depthplanned, the limits of a tumor, when using scalpels, saws, broaches, orany other instrument or device 138, and having a predefined workspacefor them, or when approaching structures at risk. As another example, in“passive” hand-wearable devices, computer means 100 interpret e.g. thepressure done by the surgeon's 128 fingers and joints while operating(e.g. while dissecting, or broaching), e.g. with pressure sensors, andcompares such pressure with the “normal” pressure in the current andnearby layers (or from statistical pressure range from the differenttissues), to determine if the instruments are near to structures atrisk, to display alert notifications to the field of view of the surgeon128, or directly send blocking feedback to the same hand-wearable devicewith “active” capabilities.

The same principles described above are used in case of a fullyautomated robotic device, where the surgeon 128 may operate withoutphysically being in the same room as the patient 118.

In embodiments, the principles of this invention are used as a surgicaltraining system, wherein surgery is done over an object, instead of aportion of the real patient 118. For example, to train the surgicalskills in spine surgery, the surgeon 128 may operate on an object, e.g.a surgical phantom of a trunk, or the trunk of a cadaver donor, or deadbiological tissue with a similar form to a human trunk, or any othersuitable object. Registration is done between the available 3Danatomical image (e.g. 3D volume obtained from a CT scan of the trunk ofa real patient) and the 3D surface of the real object (e.g. a surgicalphantom of a trunk). The 3D volume—3D surface registration 120, and theregistration of stereoscopic video—3D surface registration 122 areadjusted before surgery or during surgery by the surgeon 128 or otherusers through user interface means 130, 132, according to the principlesof this invention, to obtain the best possible blending of bothstructures, the phantom and the preoperative images 102. Alternatively,or in combination with the embodiment above, the 3D surface of the realpatient 118 (to whom the preoperative images 102 correspond) is used foran intermediate registration of 3D (phantom) surface—3D (patient)surface, to offer a more precise registration of stereoscopic video 116and 3D volume. Stereoscopic video 116 is blended with the 3D anatomicalimages (e.g. the 3D volume, or a 3D anatomic atlas), following theprinciples of this invention, and the stereoscopic display 126 (e.g.virtual reality display) combined with the different augmented realityhelps 134 already described in the different embodiments offer thus anaugmented reality environment. In this augmented reality environment,the surgeon 128 is able to perform surgery on objects, while seeing realexternal and internal anatomic structures. In embodiments, the virtualsurgery performed (as seen on the display 126) is recorded in videoformat, and the virtual stereoscopic result of the surgery is stored asvirtual graphics any suitable file format (e.g. STL), whereby thesurgeon 128 and other users may study the end result of surgery, e.g.the incision made, the reduction of fracture fragments, the positioningof screws, etc.

In embodiments, the augmented reality training system in accordance tothe principles of this invention is used in combination with 3D printingof soft tissues and bone of the target portion of the real patient 118.The 3D printing is based e.g. on the 3D volume obtained from CT and/orMR scans of the patient 118, to obtain a surgical phantom that sharesthe same shape and size as the real patient 118. The phantom printedincludes the structures selected by the user through computer interfacemeans, and these structures are printed in the preferred materials. Inthis manner, any surgery that has to be done on the real patient 118 maybe trained beforehand, according to the principles of this invention,over phantoms that reproduce in detail the target portions of thepatient 118. In embodiments, the 3D surface obtained from the realpatient 118 is used for image registration with the 3D printed phantom,as already described. In embodiments, phantoms that are 3D printed fromreal patients are used for training of similar surgical cases, even ifthe phantom does not exactly correspond to the actual patient 118 thatwill undergo surgery. The stored video of the surgery, or the storedgraphical representation of the end result (e.g. in STL file format), asdescribed above, may thus be used for preoperative planning, fordisplaying notifications, for identifying structures at risk, etc. tothe surgeon 128, during further surgical training on the same or asimilar object, or during real surgery to the patient 118, according tothe different embodiments described.

In embodiments, the above described methods and processes may be tied toa computing system including one or more computers. Examples of suchcomputing systems may include, but are not limited to, imaging devices102, 104, computing system 100, 3D scanner system 110, stereo camerasystem 114, tracking means 136, stereo display 126. In particular, themethods and processes described herein may be implemented as a computerapplication, computer service, computer API, computer library, and/orother computer program product.

FIG. 6 schematically shows a nonlimiting computing system 600 that mayperform one or more of the above described methods and processes.Computing system 600 is shown in simplified form. It is to be understoodthat virtually any computer architecture may be used without departingfrom the scope of this disclosure. In different embodiments, computingsystem 600 may take the form of a mainframe computer, server computer,desktop computer, laptop computer, tablet computer, home entertainmentcomputer, network computing device, mobile computing device, mobilecommunication device, gaming device, etc.

Computing system 600 includes a logic subsystem 602 and a data-holdingsubsystem 604. Computing system 600 may optionally include a displaysubsystem 606, communication subsystem 608, and/or other components notshown in FIG. 6. Computing system 600 may also optionally include userinput devices such as keyboards, mice, game controllers, cameras,microphones, touch screens, gesture and/or voice recognition devices,for example.

Logic subsystem 602 may include one or more physical devices configuredto execute one or more instructions. For example, logic subsystem 602may be configured to execute one or more instructions that are part ofone or more applications, services, programs, routines, libraries,objects, components, data structures, or other logical constructs. Suchinstructions may be implemented to perform a task, implement a datatype, transform the state of one or more devices, or otherwise arrive ata desired result.

Logic subsystem 602 may include one or more processors that areconfigured to execute software instructions. Additionally oralternatively, logic subsystem 602 may include one or more hardware orfirmware logic machines configured to execute hardware or firmwareinstructions. Processors of logic subsystem 602 may be single core ormulticore, and the programs executed thereon may be configured forparallel or distributed processing. Logic subsystem 602 may optionallyinclude individual components that are distributed throughout two ormore devices, which may be remotely located and/or configured forcoordinated processing. One or more aspects of logic subsystem 602 maybe virtualized and executed by remotely accessible networked computingdevices configured in a cloud computing configuration.

Data-holding subsystem 604 may include one or more physical,non-transitory, devices configured to hold data and/or instructionsexecutable by logic subsystem 602 to implement the herein describedmethods and processes. When such methods and processes are implemented,the state of data-holding subsystem 604 may be transformed (e.g., tohold different data).

Data-holding subsystem 604 may include removable media and/or built-indevices. Data-holding subsystem 604 may include optical memory devices(e.g., CD, DVD, HD-DVD, Blu-Ray Disc, etc.), semiconductor memorydevices (e.g., RAM, EPROM, EEPROM, etc.) and/or magnetic memory devices(e.g., hard disk drive, floppy disk drive, tape drive, MRAM, etc.),among others. Data-holding subsystem 604 may include devices with one ormore of the following characteristics: volatile, nonvolatile, dynamic,static, read/write, read-only, random access, sequential access,location addressable, file addressable, and content addressable. Inembodiments, logic subsystem 602 and data-holding subsystem 604 may beintegrated into one or more common devices, such as an applicationspecific integrated circuit or a system on a chip.

FIG. 6 also shows an aspect of the data-holding subsystem in the form ofremovable computer-readable storage media 610, which may be used tostore and/or transfer data and/or instructions executable to implementthe herein described methods and processes. Removable computer-readablestorage media 610 may take the form of CDs, DVDs, HD-DVDs, Blu-RayDiscs, EEPROMs, and/or floppy disks, among others.

It is to be appreciated that data-holding subsystem 604 includes one ormore physical, non-transitory devices. In contrast, in embodimentsaspects of the instructions described herein may be propagated in atransitory fashion by a pure signal (e.g., an electromagnetic signal, anoptical signal, etc.) that is not held by a physical device for at leasta finite duration. Furthermore, data and/or other forms of informationpertaining to the present disclosure may be propagated by a pure signal.

The terms “module,” “program,” and “engine” may be used to describe anaspect of computing system 600 that is implemented to perform one ormore particular functions. In some cases, such a module, program, orengine may be instantiated via logic subsystem 602 executinginstructions held by data-holding subsystem 604. It is to be understoodthat different modules, programs, and/or engines may be instantiatedfrom the same application, service, code block, object, library,routine, API, function, etc. Likewise, the same module, program, and/orengine may be instantiated by different applications, services, codeblocks, objects, routines, APIs, functions, etc. The terms “module,”“program,” and “engine” are meant to encompass individual or groups ofexecutable files, data files, libraries, drivers, scripts, databaserecords, etc.

It is to be appreciated that a “service,” as used herein, may be anapplication program executable across multiple user sessions andavailable to one or more system components, programs, and/or otherservices. In some implementations, a service may run on a serverresponsive to a request from a client.

When included, display subsystem 606 may be used to present a visualrepresentation of data held by data-holding subsystem 604. As the hereindescribed methods and processes change the data held by the data-holdingsubsystem, and thus transform the state of the data-holding subsystem,the state of display subsystem 606 may likewise be transformed tovisually represent changes in the underlying data. Display subsystem 606may include one or more display devices utilizing virtually any type oftechnology. Such display devices may be combined with logic subsystem602 and/or data-holding subsystem 604 in a shared enclosure, or suchdisplay devices may be peripheral display devices.

When included, communication subsystem 608 may be configured tocommunicatively couple computing system 600 with one or more othercomputing devices. Communication subsystem 608 may include wired and/orwireless communication devices compatible with one or more differentcommunication protocols. As nonlimiting examples, the communicationsubsystem may be configured for communication via a wireless telephonenetwork, a wireless local area network, a wired local area network, awireless wide area network, a wired wide area network, etc. Inembodiments, the communication subsystem may allow computing system 600to send and/or receive messages to and/or from other devices via anetwork such as the Internet.

In embodiments, the stereoscopic video image 116 is taken as theadaptable basic view of the surgeon 128 or surgeons, and may follow thesurgeon's 128 head movements, or be independent of his or her positionin the operating room. Digital images and virtual graphics blended withthe stereoscopic video 116 enhance the displayed image, and may evensubstitute it completely, through image processing 124, which mayprovide a fully customizable view to the surgeon 128.

3D scanner devices 112 described in embodiments may offer anintermediate step for registration between stereoscopic video 116 andthe graphical representation (e.g. 3D volume) of a preoperative 102 orintraoperative image 106, which allows for a more accurate, quicker,real-time image registration with automatic patient 118 pose changeadaptation, within a simple image-guided navigation system, using theavailable markerless or marker-based registration methods. The surgeon's128 viewpoint (the stereoscopic camera system 114) in turn, isindependent from the 3D scanner system 112, and can be fixed or dynamic.Accordingly, embodiments may be a more cost effective, in initialinvestment as well as in fungibles (e.g. markers). Embodiments may makeuse of anatomical models and volume rendering 104 of individualizedparts allowing for a precise registration of pose changes of the patient118 to the 3D volume 320, by determining pose changes in real time 318and translating them into a predefined 3D anatomical model with moveableinner structures. Image processing 124 (including the precise blendingof the available images) and augmented reality help 134 are adjustedduring surgery through real-time user interaction 132 e.g. throughgesture recognition (without the need to physically touch interfacemeans), outpacing the current limitations of the available navigationsystems.

Tracking of imaging devices 106, patient 118, instruments and devices138, is done with optical marker-based or markerless means, e.g. withtracker cameras forming part of the 3D scanner system 110. Tracking datais made available as virtual graphics directly into the surgeon's view128 in a stereoscopic way, offering a direct, precise and intuitiveguide for positioning of instruments and devices 138 during surgery.Preoperative 102 and intraoperative images 106 are also displayeddirectly to the surgeon's 128 field of view, and stereoscopic viewsoffer more accurate representations of the available digital and virtual(e.g. 3D volume) images during surgery. All preoperative 102 andintraoperative images 106 may tracked to the current pose of the patient118 during surgery, according to the principles of this invention.

With this navigation system there is real-time interaction of thesurgeon 128 with software-based tasks through the general 130 orreal-time user interface means 134, e.g. by gesture or voicerecognition. Gestures are recognized by motion tracking software, e.g.from images by the 3D scanner device 110, or the stereoscopic display126, or from stereoscopic video images 116. That makes the navigationsystem even more accurate, with instant adjustments made by the surgeon128 or other users to image processing 124, e.g. to the registration ofimages, or to the augmented reality help 134 displayed. Computer means100 in accordance with this invention, and especially with thestereoscopic display 126, offer a wide range of augmented realitypossibilities that are of great help to the surgeon 128 during surgery.

As already described in the embodiments above, stereoscopic video 116may be sent directly to the display 126, and then undergo registrationin the computer 100 being blended with the other images available,making the real time lag between actual scene and stereoscopic video 116negligible. Also, as already described in the embodiments above, changesin position of the patient 118, instruments and devices 138 and/orsurgeon's 128 head are tracked, so that the perspective view of thevirtual graphics composing the augmented reality view change in realtime. Therefore, images available to computer means 100 (e.g. 3Dsurface, 3D volume, or virtual representation of instruments or devices138) may adjust more quickly to changes in the position of the patient118, instruments or devices 138, or the surgeon's 128 head thanregistered stereoscopic video (that needs to pass from the cameras 114to the computer 100 as digital video 116, and then processed 124 forregistration), or even than stereoscopic video 116 sent directly to thedisplay 126. When the available digital images (e.g. 3D surface, 3Dvolume) and virtual graphics are sent directly to the display 126 (oncethe initial registration of images has been done), without blending withthe stereoscopic video 116, the time lag between the real scene and thesurgeon's 128 view is still less appreciable.

Other embodiments may be utilized in combination with an opticalsee-through display, tracking the surgeon's 128 head, using any of thecommercially available devices for this task (e.g. optical see-throughglasses). Alternatively, a stereoscopic display 126 is used that worksas a video see-through device during the surgery, but turns into anoptical see-through display, thanks to the percentage of transparencyapplied to the display's special glasses. Alternatively, a projector isused that projects the processed images 124 directly to the targetportion of the patient 118. Alternatively, the surgeon 128 may ask for achange of display during surgery, having them both head-mounted, orbeing helped by another member of the surgical team who changes them,whenever the surgeon 128 feels the need for a lag-less vision.

While the foregoing written description of the invention enables one ofordinary skill to make and use what is considered presently to be thebest mode thereof, those of ordinary skill will understand andappreciate the existence of variations, combinations, and equivalents ofthe specific embodiment, method, and examples herein. The inventionshould therefore not be limited by the above described embodiment,method, and examples, but by all embodiments and methods within thescope and spirit of the invention as claimed.

What is claimed is:
 1. A system comprising: a stereoscopic opticalsee-through head mounted display; a physical instrument or physicaldevice; at least one computing system; and at least one camera, whereinthe at least one computing system is configured to receive preoperative,intraoperative, or preoperative and intraoperative image information ofa physical spine of a patient, wherein the at least one computing systemis configured to track, using the at least one camera, a first markerconfigured to be attached to a bony structure of the physical spine anda second marker configured to be attached to the physical instrument orthe physical device, wherein the at least one computing system isconfigured to generate a three-dimensional (3D) surface representationof at least a portion of the tracked physical instrument or physicaldevice, wherein the at least one computing system is configured togenerate a 3D stereoscopic view, displayed by the stereoscopic opticalsee-through head mounted display, the 3D stereoscopic view comprisingthe 3D surface representation of the at least portion of the trackedphysical instrument or physical device, wherein the system is configuredto adjust in real time the 3D stereoscopic view responsive to movementof the stereoscopic optical see-through head mounted display, whereinthe system is configured to adjust in real time the 3D stereoscopic viewresponsive to movement of the physical spine from a first position to asecond position, wherein the at least one computing system is configuredto superimpose the 3D stereoscopic view on a target portion of thephysical spine, and wherein the first and second positions of thephysical spine of the patient are different.
 2. The system of claim 1,wherein the at least one camera is head mounted with the stereoscopicoptical see-through head mounted display.
 3. The system of claim 1,wherein the at least one camera is separate from the stereoscopicoptical see-through head mounted display.
 4. The system of claim 1,wherein the computing system is configured to update the location andorientation of the 3D surface representation in real-time in thestereoscopic optical see-through head mounted display.
 5. The system ofclaim 1, wherein the 3D surface representation comprises a virtualtrajectory for the physical instrument or physical device, a virtualpin, a virtual screw, a virtual nail, a virtual plate, a virtualtemplate of the physical instrument, a virtual template of the physicaldevice, or any combinations of the foregoing.
 6. The system of claim 1,wherein the position and orientation of the 3D surface representation ofthe at least portion of the tracked physical instrument or physicaldevice is updated in real time based on the position and orientation ofthe second marker.
 7. The system of claim 1, wherein the systemcomprises at least one inertial measurement unit, and wherein the systemis configured to adjust in real time the at least one 3D stereoscopicview responsive to movement of the stereoscopic optical see-through headmounted display using the at least one an inertial measurement unit. 8.The system of claim 1, wherein the first marker comprises at least oneoptical marker and at least one radiopaque marker.
 9. The system ofclaim 8, wherein the at least one radiopaque marker is included in theimage information of the physical spine of the patient.
 10. The systemof claim 9, wherein the at least one computing system is configured toregister the image information with the physical spine using the atleast one radiopaque marker.
 11. The system of claim 8, wherein theoptical marker is a color marker, a reflective marker, or a color markerand a reflective marker.
 12. The system of claim 1, wherein the 3Dstereoscopic view comprises a 3D surface representation of an anatomicalmodel, the image information of the physical spine of the patient,anatomic data derived from the image information of the physical spineof the patient or a combination thereof.
 13. The system of claim 12,wherein the at least one computing system is configured to generate the3D surface representation of the anatomical model, the image informationof the physical spine of the patient, the anatomic data derived from theimage information of the physical spine of the patient or thecombination thereof based on at least one 2D image, at least one 3Dimage, or at least one 2D and at least one 3D image of the physicalspine, wherein the at least one 2D image, at least one 3D image orcombination thereof comprises a CT image, an MRI image, an ultrasoundimage, an x-ray image or combinations thereof.
 14. The system of claim1, wherein the first marker comprises two or more markers configured tobe attached to a bony structure of the physical spine of the patient, orwherein the second marker comprises two or more markers configured to beattached to the physical instrument or the physical device, or whereinthe first marker comprises two or more markers configured to be attachedto a bony structure of the physical spine of the patient and wherein thesecond marker comprises two or more markers configured to be attached tothe physical instrument or the physical device.
 15. The system of claim1, wherein the at least one computing system is configured to determinein real time the relative location, orientation or location andorientation of the physical instrument or the physical device, the firstmarker, the second marker, the physical spine, a surgeon, thestereoscopic optical see-through head mounted display, or anycombinations of the foregoing.
 16. The system of claim 1, wherein the atleast one computing system comprises the computing system configured toreceive preoperative, intraoperative, or preoperative and intraoperativeimage information of the physical spine of the patient, the computingsystem configured to track, using the at least one camera, the firstmarker configured to be attached to the bony structure of the physicalspine and the second marker configured to be attached to the physicalinstrument or the physical device, the computing system configured togenerate the 3D surface representation of the at least portion of thetracked physical instrument or physical device, the computing systemconfigured to generate the 3D stereoscopic view, displayed by thestereoscopic optical see-through head mounted display, the computingsystem configured to superimpose the 3D stereoscopic view on a targetportion of the physical spine, and wherein one or more of the at leastone computing system are the same.
 17. The system of claim 1, whereinthe at least one computing system comprises the computing systemconfigured to receive preoperative, intraoperative, or preoperative andintraoperative image information of the physical spine of the patient,the computing system configured to track, using the at least one camera,the first marker configured to be attached to the bony structure of thephysical spine and the second marker configured to be attached to thephysical instrument or the physical device, the computing systemconfigured to generate the 3D surface representation of the at leastportion of the tracked physical instrument or physical device, thecomputing system configured to generate the 3D stereoscopic view,displayed by the stereoscopic optical see-through head mounted display,the computing system configured to superimpose the 3D stereoscopic viewon a target portion of the physical spine, and wherein one or more ofthe at least one computing system are different.
 18. The system of claim1, wherein the at least one computing system is configured to providefor adjustment of transparency, color, contrast or combinations thereofof the 3D stereoscopic view displayed by the stereoscopic opticalsee-through head mounted display superimposed on the target portion ofthe physical spine.
 19. The system of claim 12, wherein the 3D surfacerepresentation of the anatomical model, the image information of thephysical spine of the patient, the anatomic data derived from the imageinformation of the physical spine of the patient or the combinationthereof comprises a 3D surface representation of one or moreindividualized vertebrae.
 20. The system of claim 1, comprising at leastone passive optical marker, active optical marker, inertial measurementunit, magnetic tracking device, electromagnetic tracking device,ultrasonic tracking device, mechanical tracking device, tracking deviceusing inertial measurement units, 3D scanner, camera, navigation systemor a combination thereof configured to track the physical instrument orthe physical device, the physical spine, a surgeon, the opticalsee-through head mounted display or a combination thereof.
 21. Thesystem of claim 1, wherein the movement from the first position to thesecond position of the physical spine during surgery is a rotation or alateralization of at least one vertebrae.
 22. The system of claim 1,wherein the at least one computing system is configured to obtaincoordinate information of the physical instrument or the physicaldevice, the first marker, the second marker, the physical spine, asurgeon, the stereoscopic optical see-through head mounted display or acombination thereof in a coordinate system.
 23. The system of claim 22,wherein the coordinate system is the coordinate system of the at leastone camera.
 24. The system of claim 22, wherein the coordinate system isthe coordinate system of the first marker.
 25. The system of claim 1,wherein the at least one camera is configured to use visible lightinformation.
 26. The system of claim 1, wherein the at least one camerais configured to use infrared light information.
 27. The system of claim1, wherein the system is configured to generate the 3D surfacerepresentation of the at least portion of the tracked physicalinstrument or physical device from at least one point cloud, mesh, or acombination thereof.
 28. The system of claim 12, wherein the 3D surfacerepresentation of the at least portion of the tracked physicalinstrument or physical device is stored in STL file format.
 29. Thesystem of claim 1, further comprising a 3D scanner, a video system or a3D scanner and a video system configured to track the movement from thefirst position to the second position of the physical spine.